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STARCH DEBRANCHING ENZYMES 

TECHNICAL FIELD 

The present invention relates to enzymes having starch 
debranching activity. It further relates to nucleic acid 
encoding such enzymes, and methods of producing and using 
such enzymes and nucleic acid. 

PRIOR ART 

Starch is composed of highly branched (amylopectin) , and 
lightly branched (amylose) glucan polymers arranged in a 
three-dimensional, semicrystalline structure, the starch 
granule. The degree of branching of amylopectin and the 
spatial orgauiization of branches within the starch 
granule are very important in determining the physical 
properties of the starch and hence its value as a raw 
material for industry. The traditional view is that the 
branching pattern of amylopectin, and hence the way in 
which it is organised to form a granule, is determined by 
starch-branching enzymes which cleave short glucans from 
the non-reducing ends of chains and join them to residues 
within the same or an adjacent chain via Qf(l-6) linkages 
to form branches. There is, however, increasing evidence 
that the branching pattern of amylopectin results from 
the combined actions of branching and debranching 
enzymes . 

'^Debranching enzymes" hydrolyse a (1-6) glucosidic 
linkages in glucans. In plants, two quite different types 
have been described: 

The ''pullulanase" (SC 3.2.1.41) type is widely 
distributed in starch-degrading organs and in the 
chloroplasts of leaves. It is capable of the hydrolysis 
of the a (1-6) linkages of pullulan, amylopectin and a- 
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limit dextrins, but usually cannot hydrolyse glycogen. 

The second type of debranching ienzyme, the **isoamylase" 
(EC 3.2.1.68) type, has been described only in potato 
5 tubers and maize endosperm, but this is probably because 

there is, at the moment, no specific assay for isoamylase 
activity in crude extracts (i.e. where other hydrolysing 
enzymes may be present) . It can hydrolyse the a (1-6) 
linkages of amylopectin, glycogen and a-limit. dextrins, 
10 but. not pullulan. 

Evidence that debranching enzymes may be involved in 
determining amylopectin structure comes from analysis of 
the sugary (an 1) mutant of maize (Pan and Nelson 1984, 

15 James et al . 1995), the sugary mutant of rice (Nakamura 

et al. 1996a) and the STA 7 mutant of Chlamydomonas 
(Mouille et al. 1996) . All three mutations reduce or 
eliminate synthesis of conventional starch and cause the 
accumulation of a highly-branched, water-soluble glucan 

20 known as phytoglycogen . This change is accompanied by a 

reduction in the activity of debranching enzymes. In 
both maize and rice endosperm the activity of the 
pullulanase type of debranching enzyme is decreased, and 
in Chlamydomonas the activity of a debranching enzyme of 

25 unlcnown type disappears. In general terms, therefore, 
these phenotypes suggest that debranching enzyme is 
involved in determining the structure of amylopectin. 
However, understanding of the mutant phenotypes is far 
from complete. 

30 

Before the priority date of the present application, the 
sul locus from maize had been shown to encode a 
polypeptide which is very similar in amino-acid sequence 
to the bacterial isoamylase type of debranching enzyme, 
35 and not to pullulanases (James et al. 1995). Note, 
though, that the 5' end of the sequence was not 
necessarily complete in this publication. No effect of 
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the mutation on isoaraylase activity in the endosperm was 
reported. The way in which the mutation brings about a 
decrease in pullulanase activity, and the relationship 
between this decrease and the accumulation of 
phytoglycogen were also not known. 

After the priority date of the present application, 
nearly full-length maize SUl was expressed in E. coli and 
purified. The recombinant enzyme was classified as an 
isoamylase (Rahman et al, 1998 Plant Physiol 117: 425- 
435) . 

Neither the rice nor the Chlamydomonas mutations have 
been fully characterised. In the former case, it has 
been established that the gene at the sugary locus does 
not encode the pullulanase that decreases in activity in 
the mutant endosperm (Nakamura et al. 1996b) . In the 
latter case, the nature of the gene at the STA7 locus is 
not known. 

The general effects of these mutations form the basis for 
a new model to explain the synthesis of amylopectin and 
its organisation to form a granule (Ball et al. 1996) . 
Briefly, it is proposed that debranching enzyme acts to 
"trim" a highly- branched phytoglycogen-like structure 
synthesised at the periphery of the growing granule. 
This creates the branching pattern typical of amylopectin 
which, unlike the branching pattern of phytoglycogen, 
allows the polymer to pack in an organised manner to form 
the semi -crystalline matrix of the granule. 

.A critical assessment of the validity of this model is 
not yet possible, in part because of the lack of 
understanding of the mutations on which it is based, and 
in part because of the lack of information about 
debranching enzymes generally, and in starch-synthesising 
organs in particular. The nature, number and 
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intracellular location of proteins with debranching 
activity is not known for any starch- synthesising organ, 
and sequences have been reported for only one plant 
isoamylase (the sul gene product) and a very few 
pullulanases . It is not known whether either isoamylase 
or pullulanase actually have the properties and 
specificities required by the Ball model. 

Regardless of the validity of the Ball model, it seems 
highly likely that debranching enzymes play an important 
role in determining amylopectin structure, and hence in 
determining the physical properties of starch. The fact 
that the sul gene encodes an isoamylase suggests that 
this type of enzyme in particular may be involved. The 
decrease in pullulanase activity in the sul and sugary 
mutants also implicates this type of enzyme, and it has 
been reported (J. Kossraann and colleagues, MPI-MPP, Golm, 
Germany; verbal reports at open meetings) that 
modification of pullulanase activity in potato tubers 
brings about changes in the physical properties of the 
tuber starch. 

Patent application WO 95/04826 [Kossmann et al] relates 
to a debranching enzyme obtained from potato. From the 
purification procedure used to obtain the amino acid 
sequence information it would appear that this relates to 
a single enzyme of the pullulanase type. 

Patent application WO 95/03513 [Barry et al] relates to 
an isoamylase obtained from f lavobacterium spp . The 
application does not disclose any corresponding enzymes 
or sequences from plants. 

It can thus be seen that novel starch debranching 
enzymes, particularly those from plants, and particularly 
isoaraylases, may provide a useful contribution to the 
art. 
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DISCLOSURE OF THE INVENTION 

In a first aspect of the invention there is disclosed an 
isolated nucleic acid which comprises a nucleotide 
5 sequence which encodes a polypeptide which has the 

properties of an isoamylase, and is obtainable from 
Solanum tuberosum . 

Preferably the nucleic acid molecule has the sequence 
10 shown in" any of Seq ID Nos 1 to 3 or is degenerative ly 
equivalent or complementary thereto. 

Seq ID Nos 1 to 3 (Figs 1 to 3) represent nucleotide 
sequences derived by the present inventors from cDNA 

15 clones (designated 21, 15 and 9 respectively) from potato 

txabers and minitubers . Clone 15 came from a minituber 
library; clone 9 from a tuber library and clone 21 was 
found in both types of library. Each of these clones 
encodes all or part of an independent novel starch ' 

20 debranching enzyme. 

The amino acid sequences for clones 21, 15 and 9 are 
given as Seq ID Nos 4-6 (Figs 4-6) respectively 

25 The original nucleotide sequences for clones 21, 15 and 9 

which were determined initially by the inventors are 
given as Seq ID Nos 10-12 (Figs 10-12) respectively. 
Owing to very minor variations in the sequencing process 
these differ at a very few positions from the sequences 

30 above: however in the case of clones 21 and 15 there is 
in excess of 99.5% identity between new and old 
sequences. Clone 9 has also been extended at its 3' 
terminus (still in excess of 99% identity) . 
Corresponding amino acid sequences are at Seq ID Nos 13- 

35 15 (Figs 13-15) respectively. 
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Table 1 





Similarity 


Identity 








C9 


63 


46 


CIS 


82 


71 


C21 


5.8 


35 


IsoDsean 






C9 


53 


32 


CIS 


54 


31 


C21 


48 


23 


Klepn 






C9 


46.5 


21.6 


CIS 


48.5 


23.2 


C21 


50.7 


22.3 


Kleae 






C9 


47.3 


21.6 


CIS 


45.6 


21.9 


C21 


49.1 


21.8 


Sopulspo 






C9 


49.4 


26.8 


CIS 


43 .6 


22.1 


C21 


46.6 


21.8 


Puli 






C9 


49.1 


27.3 


CIS 


50 


26.6 


C21 


49.1 


22.6 



All of these sequences are unique, but show significant 
similarity at the level of predicted amino-acid sequence 
to the sul gene product of maize and the isoamylases of 
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mi cro - organi sms . 

A comparison of the clones was made with isoamylases 
(Table 1 - above) ; namely the sul gene (Sulzmay - EMBL Ac 
No U18908) and Isopseam (EMBL Ac No J03871; M28370) which 
is bacterial. Also with Pula_klepn (EMBL Ac No X52181; 
M32702) and Pula_kleae (EMBL Ac No M16187) which are 
bacterial pullulanases . Plus Sopulspo (EMBL Ac No X8396 9) 
which is a pullulanase from spinach. 

Conserved domains I to IV occur in the amino acid 
sequences of clones 21, 15 and 9 at the following 
positions (Table 2) : 



Table 2 



Domain 




I 


II 


III 


IV 


C9 


352-357 


426-434 


467-470 


535-540 


C15 


344-349 


415-423 


475-478 


543-548 


C21 


455-460 


515-523 


556-559 


623-628 



This domain structure is typical of isoamylases; domain 
III does not occur in pullulanases . 

The relationship between the various sequences is best 
illustrated by means of the dendogram (Fig 9) which shows 
the debranching enzymes in 2 groups, with clones 15, 9 
and 21 all aligning with the isoamylases. Clone 15 is 
most similar to the Sul gene. 

Since the clones show greater similarity to the 
isoamylase sequences, they have been putatively 
identified as isoamylases, and they are described as, such 
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hereinafter; however it will be understood by the skilled 
person that the essence of the present invention is the 
making available of novel starch debranching enzymes, for 
instance for some of the purposes listed below, and this 
5 contribution to the art would not be diminished should 

the enzymes have properties not wholly consistent with 
the isoamylases described in the prior art. Indeed 
different properties and/or specificities may be 
advantageous for certain applications . 

10 

Thus the present inventors have for the first time 
demonstrated the existence of multiple forms of 
isoamylase in the potato tiiber. 

15 The nucleic acid molecules or vectors (see below) 

according to the present invention may be provided 
isolated and/or purified from their natural environment, 
in substantially pure or homogeneous form, or free or 
substantially free of nucleic acid or genes of the 

20 species of interest or origin other than the sequence 

encoding a polypeptide with the required function. 
Nucleic acid according to the present invention may 
include cDNA, RNA, genomic DNA and may be wholly or 
partially synthetic. ^Nucleic acid" and '^nucleic acid 

25 molecule" have the same meaning: 

The term "^isolate" encompasses all these possibilities. 
Where a DNA sequence is specified, e.g. with reference to 
a figure, unless context requires otherwise the RNA 
30 equivalent, with U substituted for T where it occurs, is 

encompassed. 

Thus nucleic acid according to the present invention may 
comprise the sequence or complement of any one of Seq ID 
35 Nos- 1 to 3, including coding and/or non-coding regions 

where appropriate. 
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The disclosure of these sequences opens up for the first 
time the ability to manipulate the starch debranching 
activity in plants in a number of important respects. 
These include, inter alia, the ability to: 

5 

a) Reduce the activity of each of the isoamylases in 
potato tiober and other plants in which homologous enzymes 
are expressed. 

b) Increase debranching enzyme activity in the potato 
10 tuber, by high level expression of one or more of each of 

the complete or partial potato cDNAs or sequences based 
thereon . 

c) Alter the activity of each of the isoamylases in 
various different subcellular compartments (e.g. plastids 

15 or cytosol) or at various different developmental stages. 

d) Study the effect of transformation experiments on 
the activities of isoforms of the debranching enzymes and 
related enzymes of starch synthesis and degradation, on 
the rates of starch synthesis, on starch structure, on 

20 ' the accumulation of soluble carbohydrates, and on the 
degradation of starch during sprouting. 

e) . Produce novel starch types in transgenic lines. 

f ) Produce novel isoamylases having modified activity. 

g) Isolate corresponding isoamylases. 

25 

In essence the various nucleic acid molecules of the 
present invention may ultimately be used to promote or 
alter (in respect of the reaction catalysed) the nature 
of the starch debranching activity in a particular cell 
30 or organism.* In some embodiments they may be used to 

repress starch debranching activity compared with that 
expressed in the xintrans formed cell or organism e.g. 
delay, retard, inhibit or slow down such activity. 

35 In particular, alteration of debranching enzyme activity 

starch- synthesising cells would modify the structure of 
the starch accumulated in those cells in novel ways. The 
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modifications to the starch which may be achieved using 
the nucleic acid molecules of the present invention 
include : 

5 1 . Decrease the degree of branching of amylopectin 

thereby creating a starch that may swell less or form a 
stronger gel on heating in water. This may be achieved by 
increasing the activity of starch debranching enzymes, 
preferably through the over-expression of one, and most 
10 preferably, more than one, endogenous or exogenous 
debranching enzymes. 

2. Increase the degree of branching of starch thereby 
opening up the possibility of increasing its swelling 

15 properties and its ability to form a paste rather than a 

gel when heated in. water. Particularly embraced is the 
production of phytoglycogen instead of some or all of the 
normal starch. This may be achieved by decreasing 
activity through the expression of antisense RNA. 

20 ' . 

3. Changing the branching pattern of amylopectin in 
other ways, thereby altering the physical properties of 
the starch. This may be accomplished by changing the 
isoform composition of debranching enzymes in a given 

25 tissue. Thus it may be achieved by selective decreases or 
increases in activity or subcellular localisation of 
endogenous isoaraylases or the introduction of novel 
isoamylases and/or mutants, variants, derivatives or 
alleles thereof. 

30 

These and other aspects of the present invention will now 
be described in more detail. 

Thus in a second aspect of the present invention there is 
35 disclosed a nucleic acid molecule encoding a mutant, 

variant, derivative or allele of a molecule of the first 
aspect, preferably to Seq ID Nos 1 to 3 . Preferred 
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mutants, variants, derivatives and alleles are those 
which are homologous to the respective Seq ID No and 
which also encode a product which has the ability to 
promote starch debranching activity. Mutants, variants or 
5 derivatives of the complement of Seq ID Nos 1 to 3 are 

those which have the ability to repress starch 
debranching activity. 

Methods for producing or identifying such a mutant, 
10 variant, derivative or allele (or other homologue) and 
assessing homology and function will now be discussed. 

Changes to a sequence, to produce a mutant, variant or 
derivative, may be by way of one or more of addition, 

15 insertion, deletion or substitution of one or more 

nucleotides in the nucleic acid, leading to the addition, 
insertion, deletion or substitution of one or more amino 
acids in the encoded polypeptide. Of course, changes to 
the nucleic acid which make no difference to the encoded 

20 amino acid sequence (i.e. ^degeneratively equivalent') 

are included. 

As is well-understood, homology at the amino acid level 
(i.e the encoded product of the nucleic acid molecule 

25 when expressed properly in frame) is generally in terms 
of amino acid similarity or identity. Similarity allows 
for "conservative variation", i.e. substitution of one 
hydrophobic residue such as isoleucine, valine, leucine 
or methionine for another, or the substitution of one 

30 polar residue for another, such as arginine for lysine, 

glutamic for aspartic acid, or glutamine for asparagine. 
Similarity may be as defined and determined by the 
TBLASTN program, of Altschul et al. (1990) J. Mbl . Biol, 
215: 403-10, which is in standard use in the art, or, and . 

35 this may be preferred, the standard program BestPit, 

which is part of the Wisconsin Package, Version 8, 
September 1994, (Genetics Computer Group, 575 Science 
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Drive, Madison, Wisconsin, USA, Wisconsin 53711) . 
BestFit makes an optimal alignment of the best segment of 
similarity between two sequences. Optimal alignments are 
found by inserting gaps to maximize the number of matches 
using the local homology algorithm of Smith and Waterman 

As is well known to those skilled in the art, altering 
the primary structure of a peptide by a conservative 
substitution may not significantly alter the activity of 
that peptide because the side -chain of the amino acid 
which is inserted into the sequence may be able to form 
similar bonds and contacts. as the side chain of the amino 
acid which has been substituted out. This is so even when 
the siibstitution is in a region which is critical in 
determining the peptides conformation. Indeed, such 
changes may confer slightly advantageous properties on 
the peptide. 

Also included are nucleic acids having a few 
'^non-conservative" substitutions. As is well known to 
those skilled in the art, substitutions to regions of a 
peptide which are not critical in determining its 
conformation may not greatly affect its activity because 
they do not greatly alter the peptide's three dimensional 
structure. Those in important regions (e.g. conserved 
regions I to IV) may confer advantageous properties on 
the polypeptide product. Similarly it may be desirable 
to alter or otherwise manipulate the transit peptide 
sequence e.g. in clones 21 and 15, in order to alter the 
targeting or localisation properties of the enzymes. 

A mutant, variant or derivative amino acid sequence in 
accordance with the present invention may include within 
the amino acid sequence encoded by Seq ID Nos 1 to 3 (see 
Pigs 4 to 6) a single amino acid change with respect to 
the sequence shown or 2, 3, 4, 5, 6, 7, 8, or 9 changes, 
about 10, 15, 20, 30, 40 or 50 changes, or greater than 
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about 50, 60, 70, 80 or 90 changes. 

In addition to one or more changes within the coding 
sequences of any one of Seq ID Nos 1 to 3, a mutant, 
5 variant or derivative nucleic acid molecule may have 

additional nucleotides at the 5' or 3 ' terminii . In 
particular it may be desirable to have a full length 
clone e.g. including any coding or non- coding regions 
(e.g. promoter) not included in the sequences but present 
10 in nature. These regions can be identified using methods 
analogous to those used to clone homologues or alleles as 
set out below. 

In a third, aspect of the present invention there is 
15 provided a method of identifying, mapping and/or cloning 
homologues or alleles from a plant species (including . 
potato) which method employs all or part of the 
nucleotide sequence of Seq ID Nos 1 to 3 . Suitable 
methods based on the sequences provided by the present 
20 invention are discussed below. If a portion of this a 
sequence is used this will be of sufficient length to 
identify homologues or alleles as described below. 

Optionally, if a portion of nucleotide sequence is used, 
25 then this portion will not itself be identical to cuiy 
part of Seq ID No 7 (Fig 7) which was used to detect 
clones 21, 15 and 9. Such a probe may therefore detect 
homologues and/or alleles which would not be detected 
using that Seq ID No 7. 

30 

In one embodiment of the third aspect, the nucleotide 
sequence of any one of Seq ID Nos 1 to 3, or any part 
thereof, may be used in a data-base search to find 
homologous sequences, expression products of which can be 
35 tested for ability to influence starch debr smching , 
particularly for isoamylase activity. This may be 
achieved, for instance, using the vectors of the present 
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invention discussed hereinafter. 

In a further embodiment of the third aspect, an 
isoamylase allele or homologue in accordance with the 
present invention is also obtainable by means of a method 
which includes providing a preparation of nucleic acid, 
e.g. from cells from a starch accumulating organ or 
tissue of a plant, providing a nucleic acid molecule 
having a nucleotide sequence shown in or complementary to 
a nucleotide sequence shown in any one of Seq ID Nos l to 
3, preferably from within the coding sequence, contacting 
nucleic acid in said preparation with said nucleic acid 
molecule under conditions for hybridisation of said 
nucleic acid molecule to any said gene or homologue in 
said preparation, and identifying said gene or homologue 
if present by its hybridisation with said nucleic acid 
molecule . 

Thus probing may employ the standard Southern blotting 
technique. For instance DNA may be extracted from cells 
and digested with different restriction enzymes. 
Restriction fragments may then be separated by 
electrophoresis on an agarose gel, before denaturation 
and transfer to a nitrocellulose filter. Labelled probe 
may be hybridised to the DNA fragments on the filter and 
binding determined. DNA for probing may be prepared from 
RNA preparations from cells. 

Test nucleic acid may be provided from a cell as genomic 
DNA, cDNA or RNA, or a mixture of any of these, 
preferably as a library in a suitable vector. The 
information derived using genomic DNA may also be used in 
mapping, and in identifying associated non-expressed 
elements e.g. promoters. 

Binding of a probe to target nucleic acid (e.g. DNA) may 
be measured using any of a variety of techniques at the 



wo 99/06575 



PCT/GB98A)2280 



15 

disposal of those skilled in the art. For instance, 
probes may be radioactively, f luorescently or 
enzymatically labelled. Other methods not employing 
labelling of probe include examination of restriction 
5 fragment length polymorphisms, amplification using PGR 

(see below), RN'ase cleavage and allele specific 
oligonucleotide, probing. 

Preliminary experiments may be performed by hybridising 
10 under low stringency conditions. For probing, preferred 

conditions are those which are stringent enough for there 
to be a simple pattern with a small number of 
hybridisations identified as positive which can be 
investigated further. It is well known in the art to 
15 increase stringency of hybridisation gradually until only 

a few positive clones remain. Suitable conditions would 
be achieved when a large number of hybridising fragments 
were obtained while the background hybridisation was low. 
Using these conditions nucleic acid libraries, e.g. cDNA 
20 libraries representative of expressed sequences, may be 

searched . 

Those skilled in the art are well able to employ suitable 
conditions of the desired stringency for selective 

.25 hybridisation, taking into account factors such as 

oligonucleotide length and base composition, temperature 
and so on. For instance, screening may initially be 
carried out under conditions, which comprise a 
temperature of about 37° C or less, a formamide 

30 concentration of less than about 50%, and a moderate to 

low salt (e.g. Standard Saline Citrate ('BSC') = 0.15 M 
sodium chloride; 0.15 M sodium citrate; pH 7) 
concentration . 

35 Alternatively, a temperature of about 50°C or less and a 
high salt (e.g. *SSPE'= 0.180 mM sodium chloride; 9 mM 
disodium hydrogen phosphate; 9 mM sodium dihydrogen 
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phosphate; 1 mM sodium EDTA; pH 7.4) . Preferably the 
screening is carried out at about 37^C, a formamide 
concentration of about 20%, and a salt concentration of 
about 5 X SSC, or a temperature of about 50**C and a salt 
5 . concentration of about 2 X SSPE. These conditions will 
allow the identification of sequences which have a 
substantial degree of homology (similarity, identity) 
with the probe sequence, without requiring the perfect 
homology for the identification of a stable hybrid. 

10 

Suitable conditions include, e.g. for detection of 
sequences that are about 80-90% identical, hybridization 
overnight at 42®C in 0.25M Na2HP04, pH 7.2, 6.5% SDS, 10% 
dextran sulfate and a final wash at 55°C in O.IX SSC, 
15 0.1% SDS. For detection of sequences that are greater 

than about 90% identical, suitable conditions include 
hybridization overnight at 65®C in 0.25M NasHPO^, pH 7.2, 
6.5% SDS, 10% dextran sulfate and a final wash at 60®C in 
O.IX SSC, 0.1% SDS. 

20 

Hybridisation is generally followed by identification of 
successful hybrids and then isolation of nucleic acid 
which has hybridised, which may involve one or more steps 
of PGR (see below) . 

25 

Thus one part of the present invention is a probe for use 
in this method. 

In a further embodiment of this aspect of the present 
30 invention, hybridisation of a nucleic acid molecule to an 

allele or homologue may be determined or identified 
indirectly, e.g using a nucleic acid amplification 
reaction, particularly the polymerase chain reaction 
(PGR) . PGR requires the use of two primers to 
35 specifically amplify target nucleic acid, so preferably 
two nucleic acid molecules with sequences characteristic 
of any one of Seq ID Nos 1 to 3 are employed. However, if 
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RACE is used (see below) only one such specific primer 
may be needed. Characteristic in this sense is preferably 
in the sense of distinguishing them from known probes or 
sequences e.g. those associated with the sul gene. 

PGR techniques for the amplification of nucleic acid are 
described in US Patent No. 4,683,195 and Saiki et al . 
Science 239: 487-491 (1988). PGR includes steps of 
denaturation of template nucleic acid (if doxable- 
stranded) , annealing of primer to target, and 
polymerisation. " The nucleic acid probed or used as 
template in the amplification reaction may be genomic 
DNA, cDNA or RNA. PGR may be used to amplify specific 
sequences from genomic DNA, specific RNA sequences and 
cDNA transcribed from mRNA. References for the general 
use of PGR techniques include Mullis et al. Gold Spring 
Harbor Symp. Quant. Biol., 51:263, (1987), Ehrlich (ed) , 
PGR technology, Stockton Press, NY, 1989, Ehrlich et al. 
Science, 252:1643-1650, (1991), "PGR protocols; A Guide 
to Methods and Applications", Eds. Innis et al. Academic 
Press, New York, (1990) . 

Prior to any PGR that is to be perforined, the complexity 
of a nucleic acid sample may be reduced where appropriate 
by creating a cDNA library for example using RT-PCR or by 
using the phenol emulsion reassociation technique (Clarke 
et al. (1992) NAR 20, 1289-1292) on a genomic library. 

Thus a method involving use of PGR in obtaining nucleic 
acid according to the present invention may include 
providing a preparation of plant nucleic acid, providing 
a pair of nucleic acid molecule primers useful in (i.e. 
suitable for) PGR, at least one of said primers having a 
sequence shown in or complementary to all or part of a 
sequence shown in any one of Seq ID NOs 1 to 3, 
contacting nucleic acid in said preparation with said 
primers iinder conditions for performance of PGR, 
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performing PGR and determining the presence or absence of 
an amplified PGR product. The presence of an amplified 
PGR product may indicate identification of a gene of 
interest or fragment thereof. 

5 • 

Thus the methods of the invention may include 
hybridisation of one or more (e.g. two) probes or primers 
to target nucleic acid. Where the nucleic acid is 
doiible- stranded DNA, hybridisation, will generally be 

10 preceded by denaturation to produce single -stranded DNA. 
The hybridisation may be as part of a PGR procedure, or 
as part of a probing procedure not involving PGR . An 
example procedure would be a combination of PGR and low 
stringency hybridisation. A screening procedure, chosen 

15 from the many available to those skilled in the art, is 

used to identify successful hybridisation events and 
isolated hybridised nucleic acid. 

The primers for use in these methods form one part of the 
20 present invention. 

In any case, an oligonucleotide for use in probing or 
nucleic acid amplification may have about 10 or fewer 
codons (e.g. G, 7 or 8) , i.e. be about 30 or fewer 

25 nucleotides in length (e.g. IB, 21 or 24) . Generally 

specific primers are upwards of 14 nucleotides in length. 
For optimum specificity, primers of 16-24 nucleotides in 
length may be preferred. Those skilled in the art are 
well versed in the design of primers for use processes 

30 such as PGR. 

In all cases the nucleic acids of the second aspect, or 
identified using the third aspect, share homology with 
those of the first aspect. Homology may be at the 
35 nucleotide sequence and/or amino acid sequence level. 

Preferably, there is at least about 83% homology, most 
preferably at least about 85%, 90%, 95%, 95%, 97%, 98%, 
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99% or 99.5% homology. 

Homology may be over the full-length of the relevant 
sequence shown herein, or may more preferably be over a 
contiguous sequence of about or greater than about 20, 
25, 30, 33, 40, 50, 67, 133 or more amino acids or 
codons, compared with the relevant amino acid sequence or 
nucleotide sequence as the case may be. 

Similarly the mutant, variant, derivative or allele (or 
other homologue) in accordance with the present invention 
will promote, alter or repress the starch debranching 
activity of a cell into which it is introduced. 

One possible mode of analysis of this activity is by 
transformation to assess function on introduction into a 
plant, plant cell or other cell of interest; methodology 
for such transformation is described in more detail 
below. 

The nucleic acid of the present invention, which may 
contain for example DNA corresponding to any one of Seq 
ID Nos 1 to 3, may be in the form of a recombinant and 
preferably replicable vector. 

Such vectors form a fourth aspect of the present 
invention. 

DNA vector is defined to include, inter alia, any 
plasmid, cosmid, phage or Agrobacterium binary vector in 
double or single stranded linear or circular form which 
may or may not be self' transmissible or mobilizaJDle . Can 
transform prokaryotic or eukaryotic host either by 
integration into the cellular genome or exist 
extrachroraosomally (e.g. autonomous replicating plasmid 
with an origin of replication) . Vectors may be introduced 
into hosts by any appropriate method e.g. conjugation. 
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mobilisation, transformation, transfection, transduction 
or elect oporat ion. Also included are shuttle vectors by 
which is meant a DNA vehicle capable, naturally or by 
design, of replication in both the actinomycetes and 
5 related species and in bacteria and/or eucaryotic (e.g. 
higher plant, mammalian, yeast or fungal cells) . 

A vector including nucleic acid according to the present 
invention need not include a promoter or other regulatory 
10 • sequence, particularly if the vector is to be used to 

introduce the nucleic acid into cells for recombination 
into the genome. 

However, in a preferred embodiment of the fourth aspect 

15 the vector is an expression vector. Those skilled in the 

art are well able to construct vectors and design 
protocols for recombinant gene expression. Suitable 
vectors can be chosen or constructed, containing 
appropriate regulatory sequences, including promoter 

20 sequences, terminator fragments, polyadenylation 

sequences, enhancer sequences, marker genes amd other 
sequences as appropriate. For further details see, for 
example. Molecular Cloning: a Laboratory Manual: 2nd 
edition,. Sambrook et al, 1989, Cold Spring Harbor 

25 Laboratory Press. Many known techniques and protocols 
for manipulation of nucleic acid, for example in 
preparation of nucleic acid constructs, mutagenesis, 
sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in 

30 detail in Current Protocols in Molecular Biology, Second 

Edition, Aus\ibel et al . eds., John Wiley & Sons, 1992. 
The disclosures of Sambrook et al. and Austibel et al. are 
incorporated herein by reference. Specific procedures 
and vectors previously used with wide success upon plants 

35 are described by Bevan (Nucl. Acids Res. 12, 8711-8721 

(1984)) and Guerineau and Mullineaux (1993) (Plant 
transformation and expression vectors. In: Plant 
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Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS 
Scientific Publishers, pp 121-148) . 

Selectable genetic markers may be used consisting of 
chimaeric genes that confer selectable phenotypes such as 
resistance to antibiotics such as kanamycin, hygromycin, 
phosphinotr icin , chlorsulf uron , methotrexate , gentamycin , 
spectinomycin, imidazolinones and glyphosate. 

Thus nucleic acid molecules of the present invention may 
be under the control of coi appropriate promoter or other 
regulatory elements for expression in a host cell such as 
a microbial, e.g. bacterial, or plant cell. In the case 
of genomic DNA, this may contain its own promoter or 
other regulatory elements and in the case of cDNA this 
may be under the control of an appropriate promoter or 
other regulatory elements for expression in the host 
cell . 

Thus one aspect the present invention provides a gene 
construct, preferably a replicable vector, comprising a 
promoter operatively linked to a nucleotide sequence 
provided by the present invention, e.g. any one of Seq ID 
Nos 1 to 3, the complement, or any mutant, variant or 
allele thereof. 

By "promoter" is meant a sequence of nucleotides from 
which transcription may be initiated of DNA operably 
linked downstream (i.e. generally in the 3' direction on 
the sense stramd of double -stranded DNA) . 

"OpersO^ly linked" means joined as part of the same 
nucleic acid molecule, suitaUDly positioned and oriented 
for transcription to be initiated from the promoter. 
DNA operably linked to a promoter is "under 
transcriptional initiation regulation" of the promoter. 
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The promoter may include one or more sequence motifs or 
elements conferring developmental and/or tissue-specific 
regulatory control of expression. Other regulatory 
sequences may be included, for instance as identified by 
5 mutation or digest assay in an appropriate expression 

system or by sequence comparison with available 
information, e.g. using a computer to search on-line 
databases. Sequences for intra- or intercellular 
targetting may also be included e.g. plastid targetting 
10 sequences as described in, or modified from. Stark et al 

(1992) Science 258: 287-292, Also included may be 
appropriate untranscribed regions e.g. which cause the 
addition of the polyadenylate nucleotides to 3' end of 
trainscribed RNA, 

15 

Suitable promoters may include the Cauliflower Mosaic 
Virus 35S (CaMV 35S) gene promoter that is expressed at a 
high level in virtually all plant tissues (Benfey et al, 
1990a and 1990b) . Other promoters may include the tuber 
20 specific B33 promoter (Rocha-^Sosa et al (1989) EMBO J 

8:23-29), or the patatin (class I) promoter. 

In one embodiment of the fourth aspect these is disclosed 
a gene construct, preferably a replicable vector, 
25 comprising an inducible promoter operatively linked to a 

nucleotide sequence provided by the present invention. 

The present invention also provides plants transformed 
with said gene construct and methods comprising 
30 introduction of such a construct into a plant cell and/or 

induction* of expression of a construct within a plant 
cell, by application of a suitable stimulus, an effective 
exogenous inducer. 

35 The term "inducible" as applied to a promoter is well 
understood by those skilled in the art. In essence, 
expression under the control of an inducible promoter is 



wo 99/06575 



PCT/GB98/02280 



23 

"switched on" or increased in response to an applied 
stimulus. The nature of the stimulus varies between 
promoters. Some inducible promoters cause little or 
undetectable levels of expression (or no expression) in 
the absence of the appropriate stimulus. Other inducible 
promoters cause detectable constitutive expression in the 
absence of the stimulus. Whatever the level of expression 
is in the absence of the stimulus, expression from any 
inducible promoter is increased in the presence of the 
correct stimulus. The preferable situation is where the 
level of expression increases upon application of the 
relevant stimulus by an amoxint effective to alter a 
phenotypic characteristic. Thus an inducible (or 
"switchable") promoter may be used which causes a basic 
level of expression in the absence of the stimulus which 
level is too low to bring about a desired phenotype (and 
may in fact be zero) . Upon application of the stimulus, 
expression is increased (or switched on) to a level which 
brings about the desired phenotype. 

A suitable inducible promoter is the GST-II-27 gene 
promoter which has been shown to be induced by certain 
chemical compounds which can be applied to growing 
plcuits. The promoter is functional in both 
monocotyledons and dicotyledons . It can therefore be 
used to control gene expression in a variety of 
genetically modified plants, including field crops such 
as canola, sunflower, tobacco, sugarbeet, cotton; cereals 
such as wheat, barley, rice, maize, sorghum; fruit such 
as tomatoes, mangoes, peaches, apples, pears, 
strawberries, bananas, and melons; and vegetables such as 
carrot, lettuce, cabbage and onion. The GST-II-27 
promoter is also suitcUDle for use in a variety of 
tissues, including roots, leaves, stems and reproductive 
tissues . 

The vectors of the fourth aspect of the invention may be 
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used, inter allst, to transform plants and plant cells 
thereby altering their properties in a number of 
important respects . 

Thus specific changes in activity of individual forms of 
isoamylase may be brought about by sense or antisense 
transformation. The disclosure by the present inventors 
of several forms of isoamylase has opened up the 
possibility of *fine tuning' their effect (s) on 
amylopectin structure and hence on the properties of 
starch, depending upon which isoform is changed, the 
degree to which the activity is increased or decreased, 
and the timing of this change in relation to the period 
of starch synthesis in the tuber. Changes in activity of 
more than one isoform simultaneously can likewise be used 
to produce unique effects on amylopectin structure and 
hence on the properties of starch. Similarly the 
introduction via transformation of one or more of the 
isoamylases from the potato tiiber into starch- 
synthesising organs of other species may be used to bring 
about unicjue and novel changes in the structure of 
amylopectin and hence in the properties of starch in 
those organs. 

Thus in a fifth aspect of the present invention there is 
disclosed a host cell containing nucleic acid or a vector 
according to the present invention, especially a plant or 
a microbial cell. 

This aspect of the present invention further encompasses 
a host cell transformed with nucleic acid or a vector 
according to the present invention, especially a plant or 
a microbial cell. Within the cell, the nucleic acid may 
be incorporated within the chromosome. There may be more 
than one such heterologous nucleotide sequence per 
haploid genome. 
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When introducing a chosen gene construct into a cell, 
certain considerations must be taken into account, well 
known to those skilled in the art. The nucleic acid to be 
inserted should be assembled within a construct which 
contains effective regulatory elements which will drive 
transcription. There must be available a method of 
transporting the construct into the cell. Once the 
construct is within the cell membrane, integration into 
the endogenous chromosomal material either will or will 
not occur. Finally, as far as plants are concerned the 
target cell type must be such that cells can be 
regenerated into whole plants (see below) . 

Plants transformed with the DNA segment containing the 
sequence may be produced by standard techniques which are 
already known for the genetic manipulation of plants. DNA 
can be transformed into plant cells using any suitable 
technology, such as a disarmed Ti-plasmid vector carried 
by Agrobacterium exploiting its natural gene transfer 
ability (EP-A-270355, EP-A-0116718 ^ NAR 12(22) 8711 - 
87215 1984) , particle or microprojectile bombardment (US 
5100792, EP-A-444882, EP-A-434616) microinjection (WO 
92/09696, WO 94/00583, EP 331083, EP 175966, Green et al . 
(1987) Plant Tissue and Cell Culture, Academic Press) , 
electroporation (EP 290395, WO 8706614 Gelvin Debeyser - 
see attached) other forms of direct DNA uptake (DE 
4005152, WO 9012096, US 4684611), liposome mediated DNA 
uptake (e.g. Freeman et al. Plant Cell Physiol. 29: 1353 * 
(1984)), or the vortexing method (e.g. Kindle, PNAS 
U.S.A. 87: 1228 (1990d). Physical methods for the 
transformation of plant cells are reviewed in Oard, 1991, 
Biotech. Adv. 9: 1-11. 

Agrobacterium transformation is widely used by those 
skilled in the art to transform dicotyledonous species. 
Recently, there has been substantial progress towards the 
routine production of stable, fertile transgenic plants 
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in almost all economically relevant monocot plants 
(Toriyama, et al. (1988) Bio/Technology G, 1072-1074; 
Zhang, et al. (1988) Plant Cell Rep. 1, 379-384; Zhang, 
et al. (1988) Theor Appl Genet 16, 835-840; Shimamoto, et 
5 ai. (1989) Nature 338, 274-276; Datta, et al. (1990) 

Bio/Technology 8, 736-740; Christou, et al. (1991) 
Bio/Technology 9, 957-962; Peng, et al. (1991) 
International Rice Research Institute, Manila, 
Philippines 563-574; Cao, et al. (1992) Plant Cell Rep. 

10 11, 585-591; Li, et al. (1993) Plant Cell Rep. 12, 250- 

255; Rathore, et al. (1993) Plant Molecular Biology 21, 
871-884; Fromm, et al. (1990) Bio/Technology 8, 833-839; 
Gordon-Kamm, et al. (1990) Plant Cell 2, 603-618; 
D'Halluin, et al. (1992) Plant Cell 4, 1495-1505; 

15 Walters, et al. (1992) Plant Molecular Biology 18, 189- 

200; Koziel, et al. (1993) Biotechnology 11, 194-200; 
Vasil, I. K. (1994) Plant Molecular Biology 25, 925-937; 
Weeks, et al. (1993) Plant Physiology 102, 1077-1084; 
Somers, et al. (1992) Bio/Technology 10, 1589-1594; 

20 W092/14B28) . In particular, AgroJbacterium mediated 
trcuisformation is now emerging also as an highly 
efficient alternative transformation method in monocots 
(Hiei et al. (1994) The Plant Journal 6, 271-282). 

25 The generation of fertile transgenic plants has been 

achieved in the cereals rice, maize, wheat, oat, and 
barley (reviewed in Shimamoto, K. (1994) Current Opinion 
in Biotechnology 5, 158-162.; Vasil, et al. (1992) 
Bio/Technology 10, 667-674; Vain et al., 1995, 

30 Biotechnology Advances 13 (4) : 653-671; Vasil, 1996, 
Natxire Biotechnology 14 page 702) . 

Microprojectile bombardment, electroporation and direct 
DNA uptake are preferred where Agrobacterium is 
35 inefficient or ineffective. Alternatively, a combination 

of different techniques may be employed to enhance the 
efficiency of the transformation process, eg bombardment 
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with Agrobacterium coated microparticles {EP-A-486234) or 
microprojectile bombardment to induce wounding followed 
by co-cultivation with Agrobacterium (EP-A-486233) . 

Following transformation, a plant may be regenerated, 
e.g. from single cells, callus tissue or leaf discs, as 
is standard in the art. Almost any plant can be entirely 
regenerated from cells, tissues and organs of the plant. 
Available techniques are reviewed in Vasil et al . , Cell 
Culture and SoToatic Cell Genetics of Plants, Vol I, II 
and III, Laboratory Procedures and Their Applications, 
Academic Press, 1984, and Weissbach and Weissbach, 
Methods for Plant Molecular Biology, Academic Press, 
1989. 

The particular choice of a transformation technology will 
be determined by its efficiency to transform certain 
plant species as well as the experience and preference of 
the person practising the invention with a particular 
methodology of choice. It will be apparent to the skilled 
person that the particular choice of a transformation 
system to introduce nucleic acid into plant cells is not 
essential to or a limitation of. the invention, nor is the 
choice of technique for plant regeneration. 

Thus a sixth aspect of the present invention provides a 
method of generating a cell involving introduction of a 
vector as described in relation to the fourth aspect 
above into plant cell. and causing or allowing 
recombination between the vector and the cell genome to 
introduce the sequence of nucleotides into the genome. 
Preferably the cell, is a plant cell. 

Thus according to the invention there is provided a plant 
cell having incorporated into its genome nucleic acid, 
particularly heterologous nucleic acid, as provided by 
the present invention, under operative control of a 
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regulatory sequence for control of expression. The 
coding sequence may be operably linked to one or more 
regulatory sequences which may be heterologous or foreign 
to the gene i.e. which is not naturally associated with 
the gene for its expression. The nucleic acid according 
to the invention may be placed under the control of an 
externally inducible gene promoter to place expression 
under the control of the user. 

The term "heterologous" may be used to indicate that the 
gene/sequence of nucleotides in question have been 
introduced into said cells of the plant or an ancestor 
thereof, using genetic engineering, i.e. by human 
intervention. A transgenic plant cell, i.e. transgenic 
for the nucleic acid in question, may be provided. The 
transgene may be on an extra -genomic vector or 
incorporated, preferably stably, into the genome. A 
heterologous gene may replace an endogenous equivalent 
gene, ie one which normally performs the same or a 
similar function, or the inserted sequence may be 
additional to the endogenous gene or other sequence. An 
advantage of introduction of a heterologous gene is the 
cibility to place expression of a sequence under the 
control of a promoter of choice, in order to be able to 
influence expression according to preference . 
Furthermore, mutants, variants and derivatives of the 
wild- type gene, e.g. with higher or lower activity than 
\^ld-type, may be used in place of the endogenous gene. 
Nucleic acid heterologous, or exogenous or foreign, to a 
plant cell may be non-naturally occurring in cells of 
that type, variety or species. Thus, nucleic acid may 
include a coding sequence of or derived from a particular 
type of plant cell or species or variety of plant, placed 
within the context of a plant cell of a different type or 
species or variety of plant. A further possibility is 
for a nucleic acid sequence to. be placed within a cell in 
which it or a homologue is found naturally, but wherein 
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the nucleic acid sequence is linked and/or adjacent to 
nucleic acid which does not occur naturally within the 
cell, or cells of that type or species or variety of 
plant, such as operably linked to one or more regulatory 
5 sequences, such as a promoter sequence, for control of 

expression. A sequence within a plant or other host cell 
may be identifiably heterologous, exogenous or foreign. 

A plant may be regenerated from one or more transformed 
10 plant cells. Thus a plant including a plant cell 

according to the invention forms a seventh aspect of the 
present invention, along with any part or propagule 
thereof, seed, selfed or hybrid progeny and descendants ^ 

15 A plant according to the present invention may be one 

which does not breed true in one or more properties, 
Plant varieties may be excluded, particularly registrable 
plant varieties according to Plant Breeders' Rights. It 
is noted that a plant need not be considered a '^plant 

20 variety" simply because it contains stably within its 

genome a transgene, introduced into a cell of the plant 
or an ancestor thereof. 

Preferred plants of the present invention include 
25 modified potato, pea, maize, wheat, cassava, rice and 
barley. 

In addition to a plant, the present invention provides 
any clone of such a plant, seed, selfed or hybrid progeny 

30 and descendants, and any part of any of these, such as 

cuttings, seed. The invention provides any plant 
propagule, that is any part which may be used in 
reproduction or propagation, sexual or asexual, including 
cuttings, seed and so on. Also encompassed by the 

35 invention is a plant which is a sexually or asexually 

propagated off -spring, clone or descendant of such a 
plant, or any part or propagule of said plant, off- 
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spring, clone or descendant. 

As discussed above, particularly embraced by the present 
invention are methods of influencing or affecting the 
5. starch debranching activities of a plant comprising the 
use of any of the nucleic acids, vectors and/or other 
materials or methods discussed in relation to aspects one 
to seven above, including causing or allowing expression 
of a heterologous nucleic acid sequence within cells of 
10 the plant. Such methods form an eighth aspect of the 
present invention. 

In one embodiment there is provided a method including 
expression, of a nucleic acid molecule having a sequence 

15 identical or complementary to all or part of Seq ID Nos 1 

to 3, or a mutant, variant, allele or other derivative of 
the sequence, within cells of a plant (thereby producing 
the encoded polypeptide) , following an earlier step of 
introduction of the nucleic acid into a cell of the plant 

20 or an ancestor thereof. Such a method may be used to 

influence the starch generated within the cells of that 
plant . 

In the present invention, over-expression may be achieved 
25 by introduction of the nucleic acid molecules discussed 

above in a sense orientation. Thus, the present 
invention provides a method of influencing the starch 
debranching activity of a plant, the method including 
causing or allowing expression of the product 
30 (polypeptide or nucleic acid transcript) encoded by 

heterologous nucleic acid according to the invention from 
that nucleic acid within cells of the plant. 

Conversely, down-regulation of expression of a target 
35 gene (i.e. an isoaraylase encoded by any of the nucleic 

acid molecules of the present invention) may be achieved 
using anci-sense technology or "sense regulation" ("cc- 
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suppression") . 

In using anti-sense genes or partial gene sequences to 
down- regulate gene expression, a nucleotide sequence is 
5 placed under the control of a promoter in a "reverse 

orientation" such that transcription yields RNA which is 
complementary to normal mRNA transcribed from the "sense" 
strand of the target gene. See, for example, Rothstein 
et al, 1987; Smith et al, (1988) Nature 334, 724-726; 
10 Zhang et al, (1992) The Plant Cell 4, 1575-1588, English 

et al., (1996) The Plant Cell 8, 179-188. Antisense 
technology is also reviewed in Bourque, (1995) , Plant 
Science 105, 125-149, and Flavell, (1994) PNAS USA 91, 
3490-3496. 

15 

An alternative is to use a copy. of all or part of the 
target gene inserted in sense, that is the same, 
orientation as the target gene, to achieve reduction in 
expression of the target gene by co- suppression. 

20 

Both of these methods will now be discussed in more 
detail . 

The complete sequence corresponding to the coding 
25. sequence of the targeted isoamylase (in reverse 

orientation for ant i- sense) need not be used. For 
example fragments of sufficient length may be used. It 
is a routine matter for the person skilled in the art to 
screen fragments of various sizes and from various parts 
30 of the coding sequence to optimise the level of anti- 

sense inhibition. It may be advantageous to include the 
initiating methionine ATG codon, and perhaps one or more 
nucleotides upstream of the initiating codon. A further 
possibility is to target a conserved sequence of a gene, 
35 e.g. a sequence that is characteristic of one or more 
genes, such as a regulatory sequence. 
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The sequence employed may be about 500 nucleotides. 
However it may be less, possibly about 400 nucleotides, 
about 300 nucleotides, about 200 nucleotides, or about 
100 nucleotides. It may be even be possible to use 
oligonucleotides of much shorter lengths, 14-23 
nucleotides. Longer fragments, for instance longer than 
about 500 nucleotides are preferable where possible, such 
as longer than about 600 nucleotides, than about 700 
nucleotides, than about 800 nucleotides, than about 1000 
nucleotides or more. 

It may be preferable that there is complete sequence 
identity in the sequence used for down- regulation of 
expression of a target sequence, and the target sequence, 
though total complementarity or similarity of sequence is 
not essential. One or more nucleotides may differ in the 
sequence used from the target gene. Thus, a sequence 
employed in a down- regulation of gene expression in 
accordance with the present, invention may be a wild- type 
sequence (e.g. gene) selected from those available, or a 
mutant, derivative, variant or allele, by way of 
insertion, addition, deletion or substitution of one or 
more nucleotides, of such a sequence. The sequence need 
not include an open reading frame or specify an RNA that 
would be translatable. It may be preferred for there to 
be sufficient homology for the respective anti-sense and 
sense RNA molecules to hybridise. There may be down 
regulation of gene expression even where there is about 
5%, 10%, 15% or 20% or more mismatch between the sequence 
used and the target gene, although it may be advantageous 
to have minimal mismatch. 

Thus generally speaking, the transcribed nucleic acid may 
represent a fragment of an isoamylase gene, such as any 
one of those corresponding to Seq ID Nos 1 to 3, or the 
complement thereof, or may be a mutant:, derivative, 
variant or allele thereof, in similar terms as discussed 
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above in relation to alterations being made to an coding 
sequence and the homology of the altered sequence (see 
the first or second aspects of the invention) . The 
homology may be sufficient for. the transcribed anti-sense 
5 RNA to hybridise with nucleic acid within cells of the 

plant, though irrespective of whether hybridisation takes 
place the desired effect is down -regulation of gene 
expression. 

10 Anti-sense regulation may itself be regulated by 
employing an inducible promoter in an appropriate 
construct. Thus, the present invention also provides a 
method of influencing a starch debranching activity of a 
plant, the method including causing or allowing 

15 ant i- sense transcription from heterologous nucleic acid 

according to the invention within cells of the plant. 

The anti-sense constructs {nucleic acids) themselves are 
also embraced by the present invention, as is use of 
20 these constructs for down-regulation of gene expression, 

particularly down -regulation of expression of an 
isoatnylase or homologue thereof, preferably in order to 
influence the starch debranching enzyme activity of a 
plant, especially a crop plant. 

25 

As discussed cJDOve, when additional copies of the target 
gene are inserted in sense, that is the same, orientation 
as the target gene, a range of phenotypes is produced 
which includes individuals where over-expression occurs 

30 and some where under-expression of protein from the 

target gene occurs. When the inserted gene is only part 
of the endogenous gene the number of under -expressing 
individuals in the transgenic population increases. The 
mechanism by which sense regulation occurs, particularly 

35 down -regulation, is not well-understood. However, this 

technique is well -reported in scientific and patent 
literature and is used routinely for gene control. See, 



wo 99/06575 



PCT/GB98AI2280 



34 

for example, van der Krol et al., (1990) The Plant Cell 
2, 291-229; Napoli et al., (1990) The Plant Cell 2, 
279-289; Zhang et al, 1992 The Plant Cell 4, 1575-1588, 
and US-A-5,231,020 

5 

Again, anti-sense fragments, mutants and so on may be 
used in similar terms as described above in relation to 
the second aspect . 

10 Further methods of down- regulating activity include 

inhibition by expressing dominant negative versions (i.e. 
mutcuit or truncated versions) of the isoamylases which 
will inhibit endogenous, wild- type enzymes in a 
competitive or non-competitive way e.g. by competing for 

15 binding sites on the starch granules, or by association 

to form non- functional multimers. Alternatively one can 
use ribozymes, e.g. hammerhead ribozymes, which can 
catalyse the site-specific cleavage of RNA, such as mRNA 
(see e.g. Jaeger (1997) "The new world of ribozymes" Curr 

20 Opin Struct Biol 7:324-335, or Gibson & Shillitoe 

(1997) "Ribozymes: their functions and strategies form 
their use*" Mol Biotechnol 7: 242-251.) 

Thus, the present invention also provides a method of 
25 influencing a starch debranching activity of a plant, the 
method including causing or allowing transcription of 
nucleic acid as described above, within. cells of che 
plant . 

30 Here the starch debranching activity of the product is 

preferably suppressed as a result of under-expression of 
isoamylase within the plant cells. 

In a ninth aspect of the invention there is disclosed the 
35 expression product (preferably being an isoamylase) of 
any of the nucleic acid secjuences disclosed above, 
particularly those of the first and second aspects of the 
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invention, optionally by means of the vectors of the 
fourth aspect . Example amino acid sequences are given in 
Figs 4 to 6 . Also embraced are methods of generating 
isoamylases by expression from encoding nucleic acid 
5 therefore under suitable conditions, which may be in 

suitable host cells. Following expression, the product 
may be isolated from the expression system and may be 
used as desired, for instance in formulation of a 
composition including at least one additional component. 

10 

One particular use for such expression products may be 
raising antibodies. Such antibodies form a tenth aspect 
of the present invention. 

15 Thus purified protein of the ninth aspect, or a fragment, 

mutant, derivative or variant thereof, e.g. produced 
recombinantly by expression from encoding nucleic acid, 
may be used to raise antibodies employing techniques 
which are standard in the art. Antibodies and 

20 polypeptides comprising antigen -binding fragments of 
antibodies may be used in identifying homologues from 
other species as discussed further below. 

Methods of producing antibodies include immunising a 
25 mammal (e.g. human, mouse, rat, rcibbit, horse, goat, 

sheep or monkey) with the protein or a fragment thereof . 
Antibodies may be obtained from immunised animals using 
any of a variety of techniques known in the art, and 
might be screened, preferably using binding of antibody 
30 to antigen of interest. For instance. Western blotting 

techniques or immunoprecipitation may be used (Armitage 
et al, 1992, Natxire 357: 80-82) . Antibodies may be 
polyclonal or monoclonal. 

35 As an alternative or supplement to immunising a mammal, 
antibodies with appropriate binding specificity may be 
obtained from a recombinantly produced library of 
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expressed immunoglobulin variable domains, e.g. using 
lambda bacteriophage or filamentous bacteriophage which 
display functional immunoglobulin binding domains on 
their surfaces; for instance see WO92/01047. 

5 

Antibodies raised to a polypeptide or peptide can be 
used, inter alia, in the identification and/or isolation 
and/or localisation (e.g. intracellular) of the peptides 
of the present invention and homologous polypeptides, and 

10 may also permit isolation of the encoding genes. Thus, 

the present invention provides a method of identifying or 
isolating a. polypeptide with starch debranching function 
(in accordance with embodiments disclosed herein) , 
comprising screening candidate polypeptides with a 

15 polypeptide comprising the antigen -binding domain of an 

antibody {for example whole antibody or a fragment 
thereof) which is able to bind an starch debranching 
polypeptide or fragment, variant or derivative thereof or 
preferably has binding specificity for such a 

20 polypeptide. Specific binding members such as antibodies 

and pplypeptides comprising antigen binding domains of 
antibodies that bind and are preferably specific for an 
isoaraylase or mutant, variant or derivative thereof 
represent further aspects of the present invention, as do 

25 their use and methods which employ them. 

Candidate polypeptides for screening may for instance be 
the products of an expression library created using 
nucleic acid derived from an plant of interest, or may be 

30 the product of a purification process from a natural 

source. A polypeptide foxind to bind the antibody may be 
isolated and then may be subject to amino, acid 
secjuencing. Any suitable technique may be used to 
sequence the polypeptide either wholly or partially (for 

35 instance a fragment of the polypeptide may be sequenced) 
Amino acid sequence information may be used in obtaining 
nucleic acid encoding the polypeptide, for instance by 
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designing one or more oligonucleotides (e.g. a degenerate 
pool of oligonucleotides) for use as probes or primers in 
hybridization to candidate nucleic acid, or by searching 
computer sequence databases, as discussed above. 

5 

An eleventh aspect of the present invention is a 
polysaccharide generated (in vivo or using an enzyme 
preparation) by a process comprising the use of an 
isoamylase of the ninth aspect. Also embraced is starch 

10 produced in the transformed plants and cells discussed 

above. Such starch is preferably derived from amylopectin 
but has any of a decreased, increased or otherwise 
altered degree of branching, with a corresponding 
alteration in properties e.g. swelling or aJDility to form 

15 a paste rather than a gel when heated in water. 

Commodities (e.g. foodstuffs) comprising such starches 
form a further aspect of the present invention . 

Other commodities which may benefit from the modified 
20 starches of the present invention include biodegradable 

plastics; food-processing thickeners; starch coated 
films, papers & textiles; paint thickeners; mining 
explosives; pharmaceuticals and glues. The modified 
starches can be used analagously to prior art starches in 
25 these materials, in ways which are well kiiown to those 

skilled in the respective technical fields. 

The invention will now be further illustrated with 
reference to the following non- limiting Figures and 
30 Examples. Other embodiments falling within the scope of 

the invention will occur to those skilled in the art in 
the light of these. 

FIGURES 

35 



Fig la and lb shows nucleotide Seq ID No 1 (from 
isoamylase clone 21) . 
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Fig 2a and 2b shows nucleotide Seq ID No 2 (from 
isoamylase clone 15) . 

Fig 3a and 3b shows nucleotide Seq ID No 3 (from 
isomylase clone 9) . 

Fig 4 shows amino acid Seq ID No 4 (from isoamylase clone 
21) . 



10 Fig 5 shows amino acid Seq ID No 5 (from isoamylase clone 

15) . 

Fig 6 shows amino acid Seq ID No 6 (from isomylase clone 
9) . 

15 

Fig 7 shows Seq ID No 7, corresponding to the Arabidopsis 
thaliana v. Columbia probe At69012 .new_est taken from the 
Medline Database, originally pxiblished by Newman et al 
(1994) Plant Physiol 106: 1241-1255. 

20 

Fig 8(a) shows the transit peptide sec[uence from clone 21 
(Seq ID No 8). Fig 8(b) shows the transit peptide 
sequence from clone 15 (Seq ID No 9) . 



25 Fig 9 ishows a dendogram which places the debranching 

enzymes in 2 distinct groups, with clones 15, 9 and 21 
all aligning with the isoamylases. 

Fig 10a and 10b shows nucleotide Seq ID No 10 (original 
30 sequence from isoamylase clone 21) . 

Fig 11a and lib shows nucleotide Seq ID No 11 (original 
sequence from isoamylase clone 15) . 

35 Fig 12 shows nucleotiide Seq ID No 12 (original sequence 

from isomylase clone 9) . 
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Fig 13 shows amino acid Seq ID No 13 (original sequence 
from isoamylase clone 21) . 

Fig 14 shows amino acid Seq ID No . 14 (original sequence 
5 from isoamylase clone 15) . *x' is unknown aminoacid. 

is a stop codon. 

Fig 15 shows amino acid Seq ID No 15 (original sequence 
from isomylase clone 9) , 

10 

Fig 16(a) shows the forward primer (Seq ID No 16) used to 
isolate the probe (see Examples below). Fig 16(b) shows 
^ the reverse primer (Seq ID No 17) . 

15 EXAMPLES 

Exampl e 1- cloning of the debranchina enzymes from potato 

Briefly, cDNA clones from potato were isolated from cDNA 
20 libraries synthesised from mRNA from both developing 

tubers and from in vitro grown minitubers. The probe was 
an EST from AraJbidopsis (At69012 .new_est) which was 
identified by the present inventors as showing 
significant homology to the sul gene from maize. 

25 

Libraries : 

Two independent libraries were prepared. These were - 

a) from mRNA from developing tubers from greenhouse 

30 cultivated potato plants ( Solanum tuberosum var desiree) 

b) from mRNA from minitubers induced on stem explants of 
potato ( Solanum tuberosum var desiree) cultured in vitro 
according to the method of Visser et ai (1994) Physiol 

35 Plantarum 90: 285-292. Minitubers were used in addition 

to tubers in order to assess starch synthesising organ 
which has a different gene expression profile to tubers. 
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The cDNA was synthesized by reverse transcription using 
as a template poly(A)RNA passed twice over 
oligo(dT) cellulose . The poly(A)RNA was reverse 
5 transcribed to form the first strand cDNA and the second 

strand was prepared using DNA polymerase I (large 
subunit) , T4 DNA polymerase and RNAase H as described in 
the manufacturer's instructions for the cDNA synthesis 
kit (Amersham pic, UK) . 

10 

cDNA was ligated to EcoRI adaptors as described in the 
Rapid adapter ligation kit (Amersham pic, UK) and then 
cloned into the EcoRI of the X cloning vector XgtlO 
according to the manufacturers instructions for the XgtlO 
15 cloning kit. (Amersham pic, UK) . 

Probe : 

The probe used was a fragment of an Arabidopsis EST (EMBL 
ID No. At69012 .new_est ; accession no. H36690) . This EST 
20 was identified initially using a BLAST search of EST 

databases . 

In order to determine the extent of the homology between 
the ArcUDidopsis EST and the Sul gene product, the EST was 

25 further sequenced using an Applied Biosystems Taq cycle 

sequencing kit (Perkin Elmer) and an ABI automated 
sequencer. This generated a further 780 bp sequence data. 
It showed significant homology to the deduced amino acid 
sequence for the product of the Sul gene from maize that 

3 0 encodes an isoamylase type of debranching enzyme. 

The probe was prepared using PGR amplification of a 
miniprep of the plasmid. The PGR used the M13 reverse 
primer: 

35 

5' -GAG GAA AGA GGT ATG AG -3' (SEq ID No 5) 
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And also a primer specific for the 3' end of the EST, at 
a point before the polyA tail. This was designated G3712: 

5' -GAT CAT AAC TTG AGT TCT AAG CGG. -3' 

5 

The amplified fragment was cut with PstI to remove, the 
sequences from the vector. The fragment was purified and 
then labelled using an oligonucleotide random priming 
IcdDelling kit to provide the probe. 

10 

Screening: 

Approximately 60,000 plaques from the tuber library 
(unamplif ied) and 60,000 plaques from the minituber 
librairy (unamplif ied) were used to infect E. coli (strain 
15 NM514) and the resultant plaques were screened using a 

1.2 kb fragment of EST cDNA clone (At69012 . new_est ) which 
lacked the poly (A) tail. Filters were subsequently washed 
at low stringency (2xSSC, 0.5%SDS, 55^C for two washes). 

20 16 independent phages (5 from tuber and 11 from 

minittiber) that showed different levels of hybridization 
to the EST probe were selected. 

Subclonina : 

25 

DNA from 9 independent clones was subcloned into either 
pCR2.1 (Invitrogen) or pBluescript (Stratagene) in E. 
colx. Those in pCR2 . 1 were subcloned following PGR 
amplification of the inserts using XgtlO specific 
30 oligonucleotides. Those in pBluescript were isolated as 

EcoRI fragments from XDNA preparations. 

Seoruencina & analysis : 

35 Clones were sequenced using the Taq cycle sequencing kit 

from Perkin Elmer and the ABI automated sequencer. To 
complete any incomplete sequence, primers based on the 
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known portions of the sequence are used to *walk' along 
the clones in the library to identify the reiuaining 
portions. Following initial sequencing of the C9 clone, 
a longer cDNA was obtained and sequenced. 

The predicted N- terminal amino acid sequences for C15 and 
C21 fit the criteria for plastid transit peptides. A 
summary of the cDNA clones is presented below. This 
refers to the original sequences. Corresponding 
comparisons with sul for the new sequences are shown 
above. 



Table 3 



cDNA 
clones 


length 
(kb) 


open reading 
frame (number of 
amino acids) 


relationship to sul 
(%, at amino acid 
level) 






total 


predicted 

transit 

peptide 


similarity 


identity 


C9 


2.6 


766 


none 


61 


45 


C15 


2.7 


793 


47 


82 


70 


C21 


2.9 


878 


38 


57 


35 



Example 2 - Transformation and antisense constructs 



The clones encoding the isoamylases are used to construct 
a series of lines of antisense potato plants. The clone 
{C9, CIS and C21) is subcloned in antisense orientation 
between the CaMV 35S promoter and the CaMV terminator 
sequences of pJITSO (see Guerineau & Mullinieaux (1993) in 
Plant Molecular Biology Lab Fax ed. Croy RRD BIOS 
Scientific, Oxford, UK pp 121-148) . This construct has 
been subcloned into the primary vector pBinl9 and 
transferred to Acrrobacterium tumifaciens (IaBA4404) by 
transformation and from there to potato tuber discs by 
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the method of Spychalla and Bevan (1993) Plant Tissue 
Culture Manual BII. 

Example 3 - Transformation and overexpression constructs 

5 

In other transformants, full length cDNA clones encoding 
the isoamylase type of debranching enzyme in potato are 
used to increase debranching enzyme activity levels in 
transgenic potatoes. This is achieved by cloning each of 
10 them between the 2 x CaMV 35S promoter and the CaMV 

Terminator of pJITSO. Thence into a binary vector such as 
pBinl9 between the T-DNA borders, in E. coli. It is then 
transferred to Aarobacterium tumifaciens for 
transformation into plants. 

15 

Example 4 - Enzyme purification 

Each different cDNA may be expressed in E. coli to define 
its activity closely and to obtain enough purified 
20 protein to produce an antiserum. This could.be done using 

any suitable system e.g. the pSTAG expression vector for 
E. coli (strain K38) (Moyano et al (1996) Plant Cell 8: 
1519-1532) : 

All three cDNA's were inserted into a vector which 
permitted expression in E.coli of proteins fused to a 15 
amino acid tag at the N-terminus (pET Expression system, 
Novagen) . The amount of expressed protein in E.coli 
extracts was then cjuantified by an assay for the S-tag 
and the proteins were then purified on an affinity matrix 
specific for the S-tag, The C21 and C15 cDNA' s were 
inserted into the vector after removal of the fragment 
encoding the putative transit peptide (pET system manual 
7th Edition, Novagen) . 

Fusion proteins from the C15 and C21 cDNA' s were 
successfully expressed to high levels in E.Coli as 



30 
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determined by using SDS-polyacrylamide gels (not shown) . 
In both cases, single bands of protein not present in 
E.coli transformed with the vector alone are seen in 
crude extracts of both the soluble and unsoliable 
5 (inclusion body) fractions of the bacteria. These bands 

correspond closely in size to the predicted size for the 
expressed proteins: 87}cD for C15 and 97kD for C21. 

The recombinant isoamylase is most readily assayed when 
10 it is either purified from other hydrolases or by use of 

specific inhibitors to negate the contribution of 
interfering enzymes. It may also be visualised on non- 
denaturing glucan-containing polyacrylamide gels on which 
activities of starch hydolysing enzymes are separated and 
15 then revealed by staining of hydrolysis products with 

iodine (see Kakefuda et al, 1996 Planta 168: 175-182). 

Example 5 - Preparation of antisera 

20 Polyclonal soitibodies against C15 and C21 were produced 
in New Zealand white rabbits using standard immunisation 
procedures (Harlow E. & Land D. (1988) Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbour, N.Y) . 

25 

. The immunoblot analysis was performed according to 
stctndard procedures (Sambrook J et al (1989) Molecular 
Cloning: A Laboratory Manual, Cold Spring harbor 
Laboratory Press, Cold Spring Harbor, NY) . Filters were 
30 incubated with the rabbit antiserum and immunoreactive 

bands were detected using the methods of Towbin H. Et al 
(1997) Proc. Natl. Acad. Sci. USA 76: 4350-4354. 

The presence of high-titer antibodies in antisera that 
35 recognised the proteins was demonstrated by immunoblot 

analysis of extracts from E.coli expressing either CIS or 
C21. Both CIS and C21 antisera immunoreact against CIS 
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and C21. 

Literature cited 
5 Ball et al (1996) Cell 86, 349-352. 

James et al (1995) Plant Cell 7, 417-429, 
Mouille et al (1996) Plant Cell 8, 1353-1356. 

10 

Nakamura et al (1996a) Physiol. Plant 97, 491-498. 
Nakamura et al (1996b) Planta 199, 209-218. 
15 Pan and Nelson (1984) Plant Physiol. 74, 324-328. 
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CLAIMS 

1. An isolated nucleic acid which comprises a 
nucleotide sequence which: 

5 (a) encodes a polypeptide which has the properties of an 

isoamylase, and 

(b) is obtainable from Solanum tuberosum 

2 . A nucleic acid as claimed in claim 1 wherein the 
10 polypeptide has an amino acid sequence shown in any one 

of Seq ID Nos 4, 5 or 6 . 

3 . A nucleic acid as claimed in claim 1 or 2 comprising 
a nucleotide sequence encoding an isoamylase, the 

15 sequence: 

(a) consisting of any one of Seq ID No 1, 2 or 3, or 

(b) being degeneratively equivalent to any one of Seq ID 
Nos 1 , 2 or 3 . 

20 4. A nucleic acid comprising a nucleotide secjuence 

which : 

(a) is a homologous variant of Seq ID No 1, 2 or 3, 
sharing at least 85, 90, 95, 96, 97, 98, or 99% sequence 
identity with any of said sequences, and 
25 (b) encodes a polypeptide having isoamylase activity. 

5. A nucleic acid as claimed in claim 5 wherein the 
varicuat is an allelic variant of Seq ID No 1, 2 or 3 . 

30 6. A nucleic acid as claimed in claim 4 wherein the 

variant is a derivative of Seq ID No 1, 2 or 3 by way of 
one or more of addition, insertion, deletion or 
substitution of one or more nucleotides. 

35 7. A nucleic acid as claimed in claim 6 wherein the 
variant does not encode a transit peptide sequence. 
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8 . A nucleic acid which is complementary to the nucleic 
acid of any one of claims 1 to 7 . 

9. A method for identifying or cloning an isoamylase 
from a plant species, which method employs a nucleic acid 
molecule having a nucleotide sequence of at least 15 
nucleotides, which sequence is shown in, or complementary 
to, all or part of Seq ID No 1, 2 or 3 . ^ 

10 . A method as claimed in claim 9 comprising the steps 
of: (a) providing a preparation of nucleic acid from a 
plant cell, 

(b) providing a nucleic acid molecule having a nucleotide 
sequence of at least 15 nucleotides, which sequence is 
shown in, or complementary to, all or part of Seq ID No 

1 , 2 or 3 , 

(c) contacting nucleic acid in said preparation with said 
nucleic acid molecule under conditions for hybridisation 
of said nucleic acid molecule to any nucleic acid 
encoding an isoamylase in said preparation, 

(d) identifying said nucleic acid encoding an isoamylase. 
if present by its hybridisation with isaid nucleic acid 
molecule, and optionally 

(e) confirming the identity the isoamylase encoded by the 
nucleic acid by expressing it and assessing its activity. 

11. A method as claimed in claim 10 wherein the 
hybridisation conditions are selected such to allow the 
identification of sequences having about 85% or more 
sequence identity with the nucleic acid molecule. 

12 . A method as claimed in claim 11 comprising use of 
two primers to amplify a nucleic acid encoding an 
isoamylase, at least one of the primers having a 
nucleotide sequence of at least 15 nucleotides, which 
sequence is shown in, or complementary to, all or part of 
Seq ID No 1, 2 or 3. 
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13. A method as claimed in claim 12 comprising the steps 

of: 

(a) providing a preparation of nucleic acid from a plant 
cell, 

5 (b) providing a pair of nucleic acid molecule primers 

suitable for PGR, at least one of the primers having 
a nucleotide sequence of at least 15 nucleotides, which 
sequence is shown in, or complementary to, all or part of 
Seq ID No 1, 2 or 3, 
10 (c) contacting nucleic acid in said preparation with said 

primers under conditions for performance of PGR, 

(d) performing PGR and determining the presence or 
absence of an amplified PGR product, and optionally 

(e) confirming the identity of the amplified PGR product 
15 by expressing it and assesssing its isoamylase activity. 

14 . A nucleic acid molecule for use as a probe or primer 
in the method of any one of claims 10 to 13, said 
molecule having a nucleotide sequence of at least 15, 18, 

20 21, 24 or 30 nucleotides, which sequence is shown in, or 

complementary to, all or part of Seq ID No 1, 2 or 3 . 

15 . A recombinant vector comprising the nucleic acid of 
any one of claims 1 to 8 . 

25 

16 . A vector as claimed in claim 15 which is capable of 
replicating in a suitable host . 

17 . A vector as claimed in claim 15 or claim 16 wherein 
30 the nucleic acid is opercQDly linked to a promoter or 

other regulatory element for transcription in a host 
cell. 

18 . A vector as claimed in claim 17 further comprising 
35 any one or more of the following: a terminator sequence; 

a polyadenylation sequence; an enhancer sequence; a 
marker gene. 
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19. A vector as claimed in claim 17 or claim 18 wherein 
the promoter is an inducible promoter. 

20. A vector as claimed in any one of claims 15 to 19 
5 which is a plant vector. 

21. A vector as claimed in claim 20 comprising a 
selectable genetic marker which confers a selectable 
phenotype selected from: resistance to antibiotics or 

10 herbicides. 

22. A method comprising the step of introducing a vector 
as claimed in any one of claims 15 to 21 into a cell. . 

15 23. A method for transforming a plant cell, comprising a 

method as claimed in claim 22, and further comprising the 
step of causing or allowing recombination between the 
vector and the plant cell genome to introduce the nucleic 
acid into the genome. 

20 

24. A host cell comprising a vector as claimed in any 
one of claims 15 to 21. 

25. A host cell transformed with a vector as claimed in 
25 any one of claims 15 to 21. 

26. A host cell as claimed in claim 24 or claim 25 which 
is a plant cell . 

30 27. A host cell as claimed in claim 26 which is derived 

from a starch- synthesis ing organ. 

28. A host cell as claimed in claim 27 which is in a 
tuber. 

35 

29. A host cell as claimed in any one of claims 26 to 28 
which is in a plant. 
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30. A method for producing a transgenic plant comprising 
a method as claimed in claim 23 and further comprising 
the step of regenerating a plant from the transformed 
cell. 

5 

31. A plant comprising the cell of any one of claims 26 
to 29. 

32. A plant as claimed in claim 31 produced by the 
10 method of claim 30. 

33. A plant which is the progeny of a plant as claimed 
in claim 31 or claim 32. 

15 34. A plant as claimed in any one of claims 31 to 33 

which is selected from: potato; pea; modified maize , 
wheat, cassava, rice and barley. 

35. A part or propagule of the plant of any one of 
20 claims 31 to 34. 

36. A polypeptide encoded by the nucleic acid of any one 
of claims 1 to 7. 

25 ' 37. A method of producing a polypeptide comprising 

the step of causing or allowing the expression from a 
nucleic acid of any one of claims 1 to 7 in a suitable 
host cell. 

30 38. A composition comprising the polypeptide of claim 

36. 

39. An antibody or fragment thereof, or a polypeptide 
comprising the antigen-binding domain of the antibody, 
35 capable of specifically binding the polypeptide of claim 
36. 
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40. A method of producing the antibody or fragment as 
claimed in claim 3 9 comprising the step of immunising a 
mammal with a polypeptide of claim 36. 

5 41. A method of identifying and/or isolating an 

isoamylase comprising the step of screening candidate 
polypeptides with a polypeptide comprising the antigen- 
binding domain of the antibody of claim 39. 

10 42. A method for the synthesis of a brauiched 

polysaccharide comprising the use of the polypeptide of 
claim 36. 

43. A method for altering the quality or quantity of a 
15 polysaccharide in a host cell by influencing the 

isoamylase activity in that cell, the method comprising 
use of any one or more of the following: all or part of 
the nucleic acid of any one of claims 1 to 8; the 
polypeptide of claim 36; the antibody or fragment or 
20 polypeptide comprising the antigen-binding site thereof 
of claim 39. 

44.. A method as claimed in claim 43 wherein the 
subcellular location of the isoamylase activity is 
2 5 manipulated . 

45. A method as claimed in claim 43 wherein the activity 
of two or more isoamylases is manipulated. 

30 46. A method as claimed in any one of claims 42 to 45 

wherein the polysaccharide is amylopectin. 

47. A method as claimed in any one of claims 43 to 46 
wherein the quality altered is the branching of the 

35 amylopectin. 

48. A method as claimed in claim 47 wherein the 
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amylopectin is altered in at least one of the following 
ways : 

(a) the degree of branching is decreased, or 

(b) the degree of branching is increased, or 
5 (c) the branching pattern is changed. 

49. A method as claimed in any one of claims 43 to 48 
comprising the step of causing or allowing expression of 
a nucleic acid according to any one of claims 1 to 8 

10 within the cell. 

50. A method as claimed. in any one of claims 43 to 49 
comprising repressing the isoamylase activity in the 
cell. 

15 

51. A method as claimed in claim 50 comprising the step 
of causing or allowing the transcription of part of the 
nucleic acid of any one of claims 1 to 7 in the cell such 
as to co-suppress the expression of an isoamylase. 

20 

52. A method as claimed in claim 50 comprising the step 
of causing or allowing the transcription of nucleic acid 
of claim 8 in the cell. 

25 53 . . A method as claimed in claim 50 comprising the step 

of causing or allowing the expression of a polypeptide 
comprising the antigen -binding domain of the antibody of 
claim 39. 

30 54. A method as claimed in any one of claims 43 to 53 

wherein the cell is the plant cell of any one of claims 
2S to 29. 

55. A starch the quality of which has been altered in 

35 accordcuice with the method of any one of claims 43 to 54. 

56. A plant product derived from any one of the plants 
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of claims 31 to 34 or the plauit cells of claims 26 to 29, 
said product comprising a starch of claim 55. 

57. A commodity comprising the starch of claim 55. 

5 

58. A commodity as claimed in claim 57 which is selected 
from: a human or animal foodstuff; a biodegradable 
plastic; a food-processing thickener; a starch coated 
film; a starch coated paper; a starch coated textile; a 

10 paint thickener; a mining explosive; a pharmaceutical; a 
glue - 
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X CTTATGGGAC TTGATTAAGA A 



X CTTATGGGAC TTGATTAAGA ATATGTGATC CACCAAGTTC TATATCTGAC 
51 GCTGTCGTAA CATTGTGTGC TGCTAATGGC AACTTCACCA ATACAGTTGG 
101 CTGTGCATTC ACGTTTGTTG AGCTATGGCA GTACTGAGTC AACCAAGTTQ 
151 GTTCCTTCAT CATCAGGTAA CCGTGGAAAA ATAGTATGCA GTCTAAG6AA 
201 GCTGGAATTG GAAGACATGA ATTTCTCTGG CATAGGTCGA AATAATGATC 
251 AAGAAGCTCC TAGGAGAGCT CATCGACGAA AAGCACTATC AGCATCGAGA 
301 ATTTCGCTTG TTCCATCTGC AAAAAGGGTT CCCACTTACC TTTTCAG6AC 
351 AGATATTGGT GGTCAA6TGA AAGTCTTGGT GGAAAGGACA AATGGAAAGT 
401 ACAAAGTGCT TGTAGAAGTA TTGCCATTGG AGCTCTCATA TGCACATTCT 
451 GAGCTGGTTA TGGTTTGGGG TCTTTTTAGA TCTGATGCTT CATGCTTTAT 
501 GCCTCTAGAT CTAAATAGAC GTGGAGCAGA TGGAAAAAGT AGTACTGTTG 
551 AAACACCATT TGTGCAAGQA CCTTCAGGCA AGGTCACCGT GGAGCTGGAT 
601 TTTGAAGCAA GTTTAGCCCC CTTCTATATC TCCTTCTATA TGAAGTCGCA 
651 ACTAGTTTCT GACATGGAAA ACTCAGAAAT CAGAAGTCAC AGGAACACAA 
701 ATrTTGTTGT ACCAGTTGGT CTCAGTTCA6 GGCATCCTGC TCCATTGGGT 
751 ATTTCCTTTC AGCCAGATGG ATCTGTGAAT TTTGCTCTCT TCTCACGCAG 
801 TGCAAGAAGT GTAGTTCTGT GCTTGTATGA TGACATATCA GTTGAAAAAC 
851 CTTCTTTA6A GATTGATCTA GATCCTTATA TTAATCGATC AGGCGATATT 
901 TGGCATGCTG CTTTAGATTG TTCTTTGCCA TTTAAGACTT ATGGTTATAG 
951 ATGTAAGGCG ACTACTTCTG GGAAGGGAGA GCTGGTTCTT TTGGACCCAT 
1001 ATGCTAAGGT GATAA6GC6T GTTATTCCTC GTCAGGGTGG GTCTGAGATA 
1051 CGTCCAAAAT ATCTTGGAGA ACTATGCCTG GAACCTGGCT ATGATTGGAG 
1101 CGGTGATGTC CCCCCTAGCT TACCTATQGA GAAACTAATA ATTTACCGCT 
1151 TAAATGTGAC TCAATTTACA AAG6ACAAGT CCAGTAAGCT ACCTGATGAC 
1201 CTTGCTGGAA CTTTCTCTGG CATTAGCGAA AAATGGCACC ATTTTAAAGA 
1251 TCTTGGTGTG AATGCAATGT TACTGGAGCC AATTTTCCCT TTTGATGAGC 
1301 AGAAAGGACC CTATTTTCCG TGGCATTTCT TCTCACCTGG AAATATGTAT 
1351 GGACCrrCTG GTGACCCTCT TTCTGCCATT AAATCGATGA AGGATATCGT 
1401 TAAGAAATTA CATGCTAACG GGATA6AGGT TTTTCTTGAA GTTGTTTTCA 
1451 CTCACACTGC AGAGGATGCA CCTTTGATGA ATGTTGATAA CTTTTCATAT 
1501 TGCATAAAAG CTGGTCA6TA TCTGAATATT CAAAATGCAT TGAATTGCAA 
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1551 TTACCCCATA GTCCAACAAA TGATTTTGGA CTGTCTCCGC CACTGGGTAA 

1601 TTGAGTTTCA TATTGATGGT TITCTrm ' G TCAACGCTTC 1 JUJCTIU r m 

1651 AGAGGGTTCA ATGGAGAGAT TCTATCTCGT CCTCCATTAG TTGAAGCTAT 

1701 TGCCTTTGAT CCTATCCTTT CAAAGGTCAA GATGATTGCA GATAATTGGA 

1751 ATCCATTAAC CAATGATTCG AAGGAAAATT TATTCCCTCA CTGGAGGAGA 

1801 TGGGCAGAGA TAAATATGAG ATTTTGTGAT GACATTCGAG ACTTCTTGAC5 

1851 AGGCGAGGGT CTTCTAAGCA ATCTAGCAAC ACQACTTTGT GGAAGTGGGG 

1901 ATATCTTCGC AGGT6GACGT GGTCCTGCAT TCTCTTTTAA TTATATTGCC 

1951 AGAAATTCTG GACTCACACT TGTTGACCTA GTTAGCTTCA GTAGTAATGA 

2001 AGTGGCTTCA GAGTTAAGTT GGAACTGTGG ACAAGAAGGC GCTACGACCA 

2051 ATAACATTGT CCTAGAGAGA CGACTTAAAC AAGTTCGTAA TTTTCTGTTC 

2101 ATATTGTTCA TTTCTCTAGG TGTACCAGTA CTTAACATGG GAGACGAGTG 

2151 TGGTCAGTCT TCAGSGAGGTC CCCCTGCATA TGATGCTCGA AAATCTTTGG 

2201 6TTGGAATAC TTTAAAAACT GGTTTTGGGA CTCAGATTGC CCAGTTTATT 

2251 TCATTCTTGA GTAATTTAAG AATGAGAAGA AGTGATCTTC TTCAAAAGAG 

2301 AACCTTCTTG AAGGAAGAAA ACATCCAGTG GCATGGGAGT GACCAATCTC 

2351 CTCCGAAATQ 6GATGGCCCG TCTAGCAAAT TCTTGGCTAT GACTTTGAAG 

2401 GCCX^ATGGTG AAGTCAGCCA GACATTAGTC TCTGATATCG TAGGTGACCT 

2451 GTTTGTTOCT TTCAATGGTG CTGGTGATTC AGAGATTGTT ATCCTTCCAC 

2501 CTCCTCCAAC AGATATGOTA TGGCATCGTC TCGTTGACAC AGCCCTCCCT 

2551 TTCCCGGGGT TTTTCGATGA GAAGGGAACT CCA6TTGAA6 ATGAATTA6T 

2601 TGCTTATGAG ATGAAGTCTC ACAGCTGTTT GCrGTTTGAA 6CTCAGAGAC 

2651 TAGCTGAAAT AGATTCTA6C AAGA6AAAGA AACAGATTAG ACTTTCTTCT 

2701 AAGAGGCAAT AGrTTCTAAA GCCCCTAAGT ATATATATAT GTTTAAATAA 

2751 GAGGCTTTTT TTTCTGAATA AATAAGAAGA TTTTACTGAG AATACTTGTA 

2801 TCTAAACATT TTCTTTTGCA GCTTCAAATA AAAAAAAAAA AAA 
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-^-/rl ^ CTCAGTCCTT CTCAATTTCA GTGCCACATA CTCTAGATCA CACTCTCTCT 
51 TCTTCCTCAA AGTTCTCCCA TGGAGTTACT TCATTGTCCT TCCATTTCTA 
101 CCTACAAACC TAAACTCTCT TTCCACAACC ATCTTTTCTC GAGCyUSAAGC 
151 AGTAACGGTG TAGATTTTGA GAGTATTTGG AGAAAATCGA GGTCTTCAGT 
201 (3GTTAATGCT GCTGTTGATA GTGQACGTGG AGGTGTGGTG AAGACTGCGG 
251 CTACTGCGGT GGTGGTGGAG AAGCCGACGA CXX3AACGATG TCGTTTTCAG 
301 Girri'ATCAG GGAAGCCATT GCCGTTrGGT GCTACTGCQA CAGATGGTCG 
351 TGTGAATTTC GCTGTTmT CAAGGAATGC TACAGCTGCT ACTCTTTGCT 
401 TGATCACTCT TTCCGATTTA CCTGAGAAGA GAGTTGACCGA GCAAATTTTC 
451 CTGGATCCTC TAGCTAATAA AACTGGAGAT GTATGGCATG TGTTCCTTAA 
501 GGGAGATTTT GAGAATATGC TATATGGCTA CAAATTTGAT GGGAAATTCT 
-551 GTCCTGAAGA AG6ACACTAC TTTGACTCTT CGCAGATAGT GTTGGATCCT 
601 TATGCCAAG6 CTATAGTAAG CAGAGGAGAA TATGGTGTAT TAGGGCCAGA 
651 GGATGATTGT T6GCCCCCAA TGGCTGGCAT GGTACCTTCT GCTTCTGATC 
701 AGTTTGATTG GGAAGGAGAT CTACCACTGA AGTTTCCACA GAGAGATCTT 
751 GTAATCTATG AAATGCATGT TCGTGGGTTT ACTAATCATG AGTCGAGTGA 
801 AACAAAATAT CCTGGTACTT ACCTTGGTGT TGTGGAGAAA CTTGATCACT 
851 TiGAAGGAACT TGGTGTCAAC TGTATA6AGC TAATGCCCTG TCACGAGTTC 
901 AATGAGCTGG AGTACTATAG TTATAACTCT GTATTGGGCG ACTACAAGTT 
951 TAACTTTTGG GGCTATTCTA CTGTCAATTT CTTTTCTCCA ATGGGAAQAT 
1001 ACTCATCTGC TGGTCTAAGT AATTGCX5GCC TCGGT6CAAT AAACGAATTT 
1051 AAOTATCTTG TCAAGGAAGC ACATAAACGT GGAATCGAGG TTATCATGGA 
1101 TOTTOTTTTC AATCACACTG CTGAAGGAAA TGAAAATG6T CCCATACTAT 
1151 CArrTAGAGG CATTGAC3UW: AGTGrGTTTT ATACGCTAGC TCCTAAG6GT 
1201 GAATTTTAaV ACrACTOVGG ATGTGGAAAT ACCTTCAACT GTAATAATCC 
1251 GATTGTACGT CAATTTATAG TOGATrGCTT GAGATATTGG GTTACCGAAA 
1301 TGCACOTAgA TGGCTTCCGC TTTGATCTTG CTTCTATCCT TACAAGAAGT 
1351 AGCAGCTCGT GGAATGCTGT AAATGTCTAT GGAAATTCAA TTGACGGTGA 
1401 CGTGATCaCC ACAGGCACTC CTCTCACAAG CCCACCATTG ATTGATATGA 
1451 TTAGCAATOA TCCAATACTT CGTGGAGTAA AGCTTATAGC TGAAGCATGG 
1501 GATTGTGGAG GCCTTTACCA AGTTGGCATQ TTTCCGCACT GGGGTATCTG 
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1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 



GTC6GAGTGG AACGGAAAGT ACCGTGACAT 6GTACGGCAG TTCATCAAAG 
GCACTGATGG GTTTTCTGGG GCTTTTGCTG AATGCCTTTG TGGAAGCCCA 
AATCTATACC AGAAAGGAGG AAGAAAACCA TGGAACAGTA TAAATTTCGT 
GTGTGCCCAC GATGGTTTTA CTTTGGCTGA TTTAGTGACA TACAACAATA 
AACACAATTT GGCAAATGGA GAGGACAACA AAGACGGGGA GAATCACAAT 
AATAGTTGGA ATTGTGGTGA GGAAGGAGAA TTTGCAAGTA TCrTTGTGAA 
GAAATTGAGG AAAAGACAAA TGCGGAACTT CTTCcTCTGC cTTAtGGTTT 
CCCAAGGTGT TCCCATGATA TATATGGGCG ATGAATAT6G TCACACTAAG 
GGAGGAAACa ACAACACGtA TTGCCATGAT AATTATATTA ATTACTTCCG 
TTGGGATAAG AAGGATGAAT CTTCATCTGA TTTTTTGAGA TTTTGCGGCC 
TCATGACCAA ATTCCGCCAT GAATGTGAAT CACTGGGATT AGATGGTTTC 
CCTACAGCAG AAAGGCTGCA ATGGCATGGT CACACTCCTA 6AACTCCAGA 
TTGGTCTGAA ACAAGTC6AT TCGTTGCATT CACACTGGTC GACAAAGTGA 
AGGGAGAACT ATATATTGCC TTTAACGCCA GCCATTTGCC TGTAACGATT 
ACACTTCCAg ATAGGCCTGG TTATAGATGG CAGCCGTTTG TGGACACAGG 
CAAACCAGCA CCATTTGACT TCTTGACAGA CGACGTTCCT GAGAGAGAGA 
CAGCAGCCAA ACAATATTCT CATTTTCTGG ACGCGAACCA GTATCCGATG 
CTCAOTTATT CATCCATTAT TCTTTTACTA TCATCTGCTG ATGATGCATA 
GTTTCATTCA CCAAGTTAGG TGGAGGTAAA TCAGCTTCAG ATTTTGTTAT 
ATGCAGTGA6 GT G TT A CTTT GTAAATAAAA GTAAGAAGCA GGACAGAACA 
GAACTGCAAA CGGATAAAAT TTGTGAGGAA GAAGCT6ATG ATTTATAAGA 
tACACCTTGT aTTtTAATtG CATTTATATA AAATAAAATA nTAGTGAAAT 
TGTcTGTQcG AAAaaaaaaa AAAAAAAAAA TAAAAAAAAA AAAAAAAAAA 
AAAAAA . 
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/O'^ ^ CACAGATTCT CTTCTCCAAA AATAGGGCCC GATGATTAGA GGACCACCAC 

51 AAATCGTCCA GAAATGTCCA ACCGACATTG TAACAGTTAA CCGGACCAAT 

101 ATAGTTCCAC GAACGCACCG TCACGCTCTT CAAGATCTCC GGCAGCTTCG 

151 CCGGCOTGAT AGTCTCAGGC TCTTCTCCTC TGATCACCGG ATTCTGAAGT 

201 TTTGTACATC GGAGGAGGCG TTCCAACCTA GGllXJGI X -' G C AGCAGCTAAA 

251 CTTCAGOAAG AAGCTCCTCA AATGCTGGAC ACTTTCCCTT CATTCAAAGT 

301 TTCCCCTGQT CTGGCTCATC CACTAGGAGT ATCA6AAACT GAAAGTGGAA 

351 TAAATTTTGC AATTTTTTCT CAGCATGCTT CTGCAGTTAC ACTTTGCATA 

401 ATTCTTCCAA AGAGTGTTCA TGATGGAATG ATTGAATTAG CATTCGATCC - 

451 ACAGAAGAAC CX3CACAGGAG ACATATGGCA CATATGCATT AAGGAGTTGC 

501 CCCAAGGTGG TtnXTCTTTAT GGTTATCGCA TTGATGGACC TCGAAATTQG 

551 CATGAAG<3GC ATCGATTTGA TGATAGCATT ATTTTGGTTG ATCCTTACGC 

601 AAAACTAATT GAAGGTCGAC C3AGTTTTTGG AGATGAAAGC AATAAAATGT 

651 GTAGATTTTT TGOAACTTAT GATTTCAATA GCTTGCCTTT TGACTGGGQA 

701 GAAAATTACA AGCTTCCAAA TATACCCGAG AAAGATCTTG ttatatatga 

751 gatgaatgtt cgtoctttta ctgctgatga aacaagtagt ttggatcaaq 
801 atcaacgggg aagttacctt ggcttaattg aaaagatacc acatcttctc 

B51 GAGCTTGGTG TCAATGCAGT AGAATTATTG C Cl ' U ' iTrri ' G A Gmij ATGA 

901 ACTGGAATTA CAAAGGCGAC ctaatccgag agatcacatg atcaatacat 

951 ggggctactc aacaataaac T ' r r mxjci ' c caatgagtcg atatgcaagt 

1001 tgtggtggcg gacctgtccg tgcttcctgg gagttcaaag aaatggtcaa 

1051 ggccttgcat ggtoctggaa ttgaggtcat cttagatgtt gtttataatc 

1101 acacaaatga agctgatgat gaaaacccat atacaacctc attccgasga 

1151 ATAGACAACA AGGTTTATTA CATGGXAGftT TTAAAC3VACA ATGCTCA6CT 

1201 GCTGAATTTC GCTGGATGTG GAAATACTTT TAACTGCAAC CATCCcACAG 

1251 TCATGOAACT TATACTTGAA AGCTTAAGAC ACTGGGTCAC CGAGTATCAT 

1301 GTCGATGGAT TTCGCTTTGA TCTTGCTAGT GTTCTTTGCA GAGGGACAGA 

1351 TGGTACTCCC ATTAATGCTC CCCCCCTTGT TAAGGCCATT TCCAAAGATA 

1401 GTGTATTGTC GAGGTGCAAA ATTATTGCTG AGCCATGGGA TTGTGGAGGC 

1451 CTATATCTTG TTGGAAAOTT TCCGAACTGG GACCGGTGGO CTQAOTGGAA 

1501 TGGGAA6TAC CGCGATGACA TCAGGAGATT TATAAA666C 6ATGCTOOCA 
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1551 TGAAAGGAAA TTTTGCAACC CGTATCGCAG GTTCAGCGGA TCTGTACAGA 
1601 GTGAACAAGC GAAAGCCGTA CCACAGTGTC AACTTCGTCA TTGCCCATGA 
1651 TGGCTTTACC TTGTATGACC TTGTTTCATA CAATAATAAG CACAATGATG 
1701 CAAACGGTGA AGGTGGCAAT GATGGATGCA ATGACAACTT CAGTTGGAAT 
1751 TGTGGAATTG AAGGTGAAAC TTCAGATGCA AATATTAACG CACTGCGTTC 
1801 ACGGCAAATG AAAAATTTTC ATTTGGCACT GATGGTTTCT CA6GGAACAC 
1851 CAATGATGCT TATGGGGGAT GAGTATGGGC ATACCCGCTA TGGAAATAAT 
1901 AACAGTTATG GA;CATGATAC CGCCATCAAC AATTTCCAGT GGGGACAATT 
1951 GGAAGCAAGG AAGAATGATC ACTTCAGGTT CTTTTCCAAG ATGATAAAGT 
2001 TTCGACTGTC CCACAATGTT CTTAGAAAGG AAAACTTCAT TGAGAAGAAC 
2051 GACATTACCT GGCTCGAGGA CAACTGGTAC AATGAAGAGA GTAGATTCCT 
2101 TGCATTTATG CTCCATGATG GGAATGGAGG AGATATTTAC TTGGCATTTA 
2151 ATGCACACCA CTTCTCCATC AAAACAGCAA TACCTTCACC ACCACGAAAT 
2201 AGAAGTTGGT ACCGAGTGGT GGACACTAAT CTGAAATCAC CAGATGATTT 
2251 TGTTACTGAG GGAGTGTCTG GTATCAGTAA AACTTATGAT GTTGCGCCGT 
2301 ACTCTGCTAT CCTTCTTGAA GCAAAGCAAT AATTACCGGG ACTATGCTGC 
2351 TTTAGATGTT GTCCATGAGT TATTACAGTA TTACCTCCTT CTGGATTGGA 
2401 TAGTTCAAAT CGGAATTCAG GCTGTTAGCC TATAGATGTT TGCAATAAGC 
2451 AACCAGTTTG TTCAAGCT6C TATTGACAGG TACAAACACC CCATA6TAAT 
2501 AAGATAAACT GAGACCATTG ATCCAAAAAA AAAAAAAAAA AAAAAAAAAA 
2551 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
2601 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 
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1 MATSPIQIAV HSRLLSYGST ESTKLVPSSS GNRGKIVCSL RKLELEDMMF 

51 S6IGRNMDQB APRRAHRRKA LSASRISLVP SAKRVPTYLF RTDIGGQVKV 

101 LVERTNGKYK VLVEVLPLEL SYAHSELVMV WGLFRSDASC FMPLDLNRRG 

151 AZX3RSSTVET PFVQGPSGKV TVELDFEA5L APFYISFYMK SQLVSDMENS 

201 EXRSRRNTNF WPVGLSSGH PAPLGISFQP DGSVNFALFS RSARSWLCL 

251 YDDISVEKPS LEIDLDPYIN RSGDIWHAAI, DCSLPFKTYG YRCKATTSGK 

301 GELVLLDPYA KVIRRVIPRQ GGSEIRPKYL GELCLEPGYD WSGDVPPSLP 

351 MBKLUYRIiK VTQFTKDKSS KLPDDLAGTF SGXSEKWHHF iCDLGVNAMLL 

401 EPIFPFDEQK GFYFPWHFFS P6NMYGPSGD PLSAIKSMKD MVKKLHANGI 

451 EVFLBWFTH TAEDAPIMYV DNFSYCIKGG QYLNIQNALN CNYPIVQQMI 

501 LDCLRHWVIE FHIDGFVFVN ASSLLRGFNG EILSRPPLVB AIAFDPILSK 

551 VKMIADNWNP LTNDSKENLF PHWRRWAEIN MRFCDDIRDF LRGEGLLSNI* 

601 ATRLCGSGDI FA66RGPAFS FNYIARNSGL TLVDLVSFSS KEVASELSWN 

651 C6QEGATTNM IVLERRIJCQV RNFLFILFIS LGVPVLMMGD ECGQSSGGPP 

701 ATOARKSLGH MTLKTGFGTQ lAQFISFLSN LRMRR5DLLQ KRTFLKBENI 

751 QHBGSDQSPP KWD6PSSKFL AMTLKADAEV SQTLVSDIVG DLFVAFKGAG 

801 DSBIVIIiPPP PTDMVWHRLV DTALPFPGFF DEKGTFVEDE LVAYBMKSBS 

851 CLXiFBAQRIiA. BIDSSKRRKQ IRLSSKRQ 
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1 



MELLHCPSIS TYKPKLSFHN HLFSRRSSNG 



VDFESIWRKS RSSWNAAVD 



51 SGRGGWKTA ATAWVBKPT TERCRFBVLS GKPLPFGATA TDGGVNFAVF 

101 SRNATAATIiC LITLSDLPEK RVTEQIFLDP LANKTGDVWH VFLKGDFENM 

151 LYOyKFDGKF CPEEGHYFDS SQIVLDPYAK AIVSROEYGV LGPEDDCWPP 

201 MA6MVPSASD QFDYIE6DLPL KFPQRDLVIY EMHVRGFTNH ESSETKYPGT 

251 YLGWEKLDH LKELGVNCIE LMPCHEFNEL BYYSYNSVLG DYKFNFWGYS 

301 TVNFFSPMGR YSSAGLSNCXS LGAINEFKYL VKEAHKRGIE VIMDWFNHT 

351 AEGNENGPIL SFRGIDNSVF YTLAPKGEFY NYSGCGNTFN CNNPIVRQFI 

401 VDCLRYWVTB MHVDGFRFDL ASILTRSSSS WNAVNVYGNS IDGDVITTGT 

451 PLTSPPLIDM ISNDPILRGV iCLIAEAWDCG GLYQVGMFPH WGIWSEWNGK 

501 YRDMVRQFIK GTDGFSGAFA BCLCGSPNLY QKGGRKPWNS INFVCAHDGF 

551 TLADLVTYNN KHNLANGEDN KDGENHMNSW NCGEEGEFAS IFVKKLRKRQ 

601 MRNFFLCLMV SQGVPMIYMG DEYGHTKGGM NNTYCHDNYI NYFRWDKKDE 

651 SSSDFIiRFCG UfTKFRHBCE SLGLDGFPTA ERLQHHGHTP RTPDWSBTSR 

701 FVAFTLVDKV KGSLYIAFNA SHLPVTITLP DRPGYRWQPF VDTGKPAPFD 

751 FLTDDVPERE TAAKQYSHFL DANQYPMLSY SSIILLLSSA DDA 
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1 


MIRGPPQIVQ 


KCPTDIVTVN RTNIVPRTHR HALQDLROLR RRDSLRLFSS 


51 


DHRIUCFCTS 


EEAFOPRLVA AAKLQEBAPQ MLDTFPSFKV SPGLAHPLGV 


lOX 


5ETESGINFA 


XFSQHASAVT 


LCIILPKSVH DOIIELALDP QKNRTGDIWH 


151 


ICIKELPQGG VLYGYRIDGP 


RNWHEGHRFD DSIILVDPYA KLIEGRHVFG 


201 


DESNKMCRFF 


GTYDFNSLPF 


DWGENYKLPN IPBKDLVIYB MNVRAFTADB 


251 


TSSLDQDQRG 


SYLGLIEKIP 


HLLELGVNAV EiiLPVFEFDE LELQRRPKPR 


301 


DHMINTWGYS 


TINFFAPMSR 


YASCGGGPVR ASWEFKEMVK ALH6AGIEVI 


351 


LDWYMHTNE 


ADDENPYTTS 


FRGIDNKVYY MVDLNNNAQL IjNFAGCGNTF 


401 


NCNBPTVNEL 


ILESLRHWVT 


BYHVDGFRFD LASVLCRGTD GTPINAPPLV 


451 


KAISKDSVLS 


RCKZIAEPND 


C6GLYLVGKF PNWDRWAEWN GKYRDDIRRF 


501 


IKGDAGMKGM 


FATRIAGSAD 


LYRVMKRKPY HSVMFVIAHD GFTLYDLVSY 


551 


NNKRNDANGE 


GGNDGCNDNF 


SWNCGIEGET SDAMIHALRS RQMKNFHLAL 


601 


hfVSQGTPMNL MGDEY6HTRY GNNNSYGHDT AIKNFX)WGQL EARKNDHFRF 


651 


FSKMIKFRIiS 


HNVLRRENFI 


EKNDITWLED NWYNEESRFL AFMLHDGNGG 


701 


DIYLAFNAHH 


FSIKTAIPSP 


PRNRSWYRW DTNLKSPDDF VTEGVSGISK 


751 


TYDVAPYSAI LLEAKQ 
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1 AGTAGTTTTT ACACATACTG CTGATTCTGG AGCTCTTCGT GGAATTGATG 

51 ACAGTTCCTA TTACTACAAG GGAAGAGCCA ATNATCTAGA TTCTAAAAGT 

101 TACTTGAACT GTAACTATCC TGTTGTTeAG CAGTTGGTAT TGGAGAGCTT 

151 GCGTTATTGG GTAACCGAGX TTCATGTAGA TGGATTTTNT TTTATAAATN 

201 CTTCATCTCT CTTGAGAGGC GTTCACGGTG AACAGCTCTC TCGTCCTCCT 

251 TTGGTTGAAG CAATAGCTTT TNATCCACTT CTTGCGGAGA CCAAACTAAT 

301 AGCTGATTGC TGGGNTCCAC TTGAAATGNT GCCANAAGAA GTACGGGTTC 

351 CCACAATTTG GAAGCNATNG GCAGAACTCA NNNCAAGGTN TTTTTCGAAA 

401 TNTNAGGAAA TTTTTTAAGG GGAA^GGG 




X MATSPIQLAV HSRI*LSYGST ESTKLVPSSS GNRGKIVC 




1 MELLHCPSIS TVKPKLSFHN ffLFSRRSSNG VDFESIWRKS asSWNA 




8B 
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pula__klepn 

• pula_kleae 

' sopulspo 

■ — ■ puli 

sulzmay 

I c15-p 

' c9-p 

— C21-P 

— isopseam 

Fig, 9 
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• 1 CTTATGGGAC TTGATTAAGA ATATGTGATC CACCAAGTTC TATATCITSAC 

51 GCTGTTCTAA CATTGTGTGC TGCTAATGGC AACTTCACCA ATACAGTTGG 

101 CTGTGCATTC ACGTTTGTTG A6CTATGGCA GTACTGAGTC AACCAAGTTG 

151 GTTCCTTCAT CATCAGGTAA CCGTGGAAAA ATAGTATGCA GTCTAAGGAA 

201 GCTGGAATTG GAAGACATGA ATTTCTCTGG CATAGGTCGA AATAATGATC 

251 AAGAAGCTCC TAGGACSAGCT CATCGACXSAA AAQCACTATC AGCATC6ASA 

301 ATTTCGCTTG TTCCATCTGC AAAAAGGGTT CCCACTTACX: TTTTCAGGAC 

351 AGATATT66T 66TCAAGTGA AAGTCTTGGT GGAAAAGACA AATGGAAAGT 

401 ACAAA6TGCT TGTAGAAgTC TTGCCATTGG AGCTCTCAGA TGCACATTCT 

451 GAGCTAGTTA TGGTTTGGGG TCTTTTTAGA TCTGATGCTT TATGCTTTAT 

' 501 GCCTCTGGAT CTAAACAGAC GTGGAGCAGA TGGAAAAAGT AGTACTGTTG 

551 AAACACCATT TGTGCAAGGA CCTTCAGGCA AGGTCACCGT GGAOCTGGAT 

601 TTTGAAGCAA GTTTAGCCCC CTTCTATATC TCCTTCTATA TGAAGTCACA 

S51 ACTAGTTTCr GACATGGAAA ACTCAQAAAT CAGAAOTCAC A6GAACACAA 

701 ATTTTGTTGT ACCAGTTGGT CTCAGTTCAG GGCATCCTGC TCC31TTGGGT 

751 ATTTCCTTTC AGCCAGATGG ATCTGTGAAT ii mC T C l Vr TCTCACGCAG 

801 TGCAAGAAGT GTAGTTCTGT GCTTGTAT6A TGACATATCA GTTGAAAAAC 

S51 CTTCTTTAGA GATTGATCTA GATCCTTATA TTRATCGATC AGGCGATATT 

901 TGGCAT6CTG CTTTAGATTG TTCTTTGCCA TTTAAGACTT ATGGTTATAG 

951 ATOTAAGGCG ACTACTTCTG GGAAGGGAGA GCTGGTTCTT TTGGACCCAT 

1001 ATGCTAA66T GATAAGGCGT GTTATTCcTc GTCAOGQTGG GTCT6AGATA 

1051 CaiCX3UUUlT ATCTTGGAGA ACTATGCCTG GAACCTGGCT ATQATTGGAG 

1101 CGCTGA3QTC CCCCCTAGCT TACCCAIGGA GAAACTAAXA ATTTACCGCT 

1151 TAAAIGTGAC TCAATTTACA AAGGACAAGT CCAGIAAGCT ACCTGAIGAC 

1201 CTTGCTG6AA CTTTcTcTGG CATTAGCGAA AAATGGCACC ATTTTAAAGA 

1251 TCTTGGTGTG AATGCAATGT TACTGGA6CC AATTTTCCcT TTTGATGAGC 

1301 AGAAAG6ACC CTATTTTCCG TGGCATTTCT TTTCACCTGG AAATATGTAT 

1351 GGACCTTCTG GTGACCCTCT TTCTGCCATT AAATCGATGA A6GATATGGT 

1401 TAAGAAATTA CATGCTAACG GGATAGAGGT TTTTCTTGAA GTTGTTTTCA 

1451 CTCACACTGC AGAGGATGCA CCTTTGATGA ATGTTGATAA CTTTTCATAT 

1501 TGCATAAAAO GTGGTCAGTA TCTGAATATT CAAAATGCAT TGAATTGCAA 
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X551 TTACCCCATA GTCCAACAAA TGATTTTGGA CTGTCTCCGC CACTGGGTAA 

1601 TTGAGTTTCA TATTGATGGT TTTGTTTTTG TCAACGCTTC ri ' CLTi ' G ' ri ' G 

1651 AGAGOGTTCA ATG6AGAGAT TCTATCTCGT CCTCCATTAg TTgaagcTaT 

1701 TGCCTTTGAT CCTATCCTTT CAAAGGTCAA GATGATTGCA GATAATTGGA 

1751 ATCCATTAAC CAATGATTCG AAGGAAAATT TATTCXCTCA CTGGAGGAGA 

1801 TGGGCAGAGA TAAATATGAG ATTTTGTGAT GACATTCGAG ACTTCTTGAG 

1851 AGGCGAgGGT CTTCTAAnCA ATCTAnCAAC ACgACTTTGT GGAAGT6G6G 

1901 ATATCTTCGC A6GTQGACGT GGTCCTGCAT TCTCTTTTAA TTATATTGCC 

1951 AGAAATTCTG GACTCACACT TGTTGACCTA GTTAGCTTCA GTA6TAATGA 

2001 AGTGGCTTCA GAGTTAAlGTT GGAACTGTGG ACAAGAAGGC GCTACGACCA 

2051 ATAACATTGT CCTAGAGAGA CGACTTAAAC AAGTTCGTAA TTTTCTGTTC 

2101 ATATTGTTCA TTTCTCTAGG TGTACCAGTA CTTAACATG6 GAGACGAGTG 

2151 TGGTCAGTCT TCA6GAG6TC CCCCTGCaTa TgATgCTCGA AAATCTTT6G 

2201 GTTGGAATAC TTTAAAAACT GGTTTTGGGA CTCAGATTGC CCAGTTTATT 

2251 TCATTCTTGA GTAATTTAAG AATGAGAAGA AGTGATCTTC TTCAAAAGAG 

2301 AACCTTCTTG AAGGAAGAAA ACATCCAGTG GCATGG6AGT GACCAATCTC 

2351 CTCCGAAATG GGATGGCCCX5 TCTAGCAAAT TCTTGGCTAT 6ACTTTGAAG 

2401 GCCGATGCTG AAGTCAGCCA GACATTAGTC TCTGATATCG TAGGTGACCT 

2451 GTTTGTTGCT TTCAATGGTG CTGGTGATTC AGA6ATTQTT ATCCTTCCAC 

2501 CTCCTCCAAC AGATATG6TA TGGCATCGTC TCGTTGACAC AGCCCTCCCT 

2551 TTCCCGGGGT TTTTCGAT6A GAAGGGAACT CCAGTTGAAQ ATGAATTAGT 

2601 TGCTTATGAG ATGAAGTCTC ACAGCTGTTT GCTGTTTGAA GCTCAGAGAC 

2651 TAGCTGAAAT AGATTCTAGC AAGAGAAAGA AACAGATTAG ACTTTcTTcT 

2701 AAGAG6CAAT AGTTTCTAAA GCCCCTAAGT ATATATATAT GTTTAAATAA 

2751 GAGGCTTTTT TTTCTGAATA AATAAGAAGA TTTTACTGAG AATACTT6TA 

2801 TCTAAACATT TTCTTTTGCA GCTTCAAATA AAAAAAAAAA AAA 
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X JLXM. 1 GGCGGCCGCT CTAGAACTAG TGGATCCCCC GGGCTGCAGG AATTCGAGGA 

51 TCCGGGTACC ATGGCTCAGT CCTTCTCAAT TTCJIGTGCCA CATACTCTAG 

101 ATCACACTCT CTCTCTTCCT CAAAGTTCTC CCATGGAGTT ACTTCATTCt 

151 CCTTCCATTT CTACCTACAA ACCTAAACTC TCTTTCCACA ACCATCTTTT 

201 CTOQAGGAGA A6CAGTAACG GTOTAgATTT T6AGA6TATT TCGAGAAAAT 

251 CGAGGTCTTC AGTGGTTAAT GCTGCTGTTG ATAGTGGACG TGGAGGTGTG 

301 GTGAAGACTC CGGCTACTGC GGTGGTGGTG GAGAAGCCGA CQACGGAACG 

351 ATGTCGTTTG AGGTTTTATC AGGGAAAGCC ATTGCCGTTT GGTGCTACTG 

401 CQACAGATGG TGGTGTGAAT TTCGCTGTTT TTCAAGGAAA IGCTACASCT 

451 GCTACTCTTT GCTTGATCAC TCTTTCCGAT TTACCTGASA AGAGAGTGAC 

501 CGAGCAAATT TTCCTGGATC CTCTAGCTAA TAAAACTGGA GATGTATCGC 

551 ATOTGTTCCT TAAGGGAGAT nTGAGAATA TGCTATATGG CTACAAATTT 

601 GATG6GAAAT TCTGTCClliA AGAAGGACAC TACTTTGACT CTTCGCAGAT 

651 AOTGTTGGAT CCTTATGCCa agGCTATAGT AAGCAGAGGA GAATATGGTG 

701 TATTAGGGCC AGAGGATGAT TGTTGGCCCC CAATGGCTGG CATGGTACCc 

751 TTCT6CTTCT GgATCAGTTT GTATTGGGAA GQA6ATCTAC CACTGgAAGT 

801 TTCCcACAgA QAQATCTTGT TnATCnATGA AATGCATGTT CGTQGGTTTA 

851 CTATCCATGA GTCGAGTGAA ACAAAATATC CTGGTACTrA CCri ' GUi ' Grr 

901 GTGGA6AAAC TTGATCACTT QAAGGAACTT GGTGTCAACT GTATAGAGCT 

951 AATGCCCICT CACGAGTTCA ATGAGCTGGA GTACTATAGT TATAACTCTG 

1001 TATTGGGC6A CTACAACTTT AACnTTGGO GCTATTCTAC TGTCAATTTC 

1051 TTTTCTCCAA TGGGAAGATA CTCATCTOCT GGTCTAAGTA ATTGCGGCCT 

1101 CX3QTGCAAXA AACgAATTTA AGTATCTTGT CAAGGAAGCA CAXAAACGTG 

llSl GAAXCGAgGT TATCATGGAT GTrGTTTTCA ATCACACTGC TGAAOGAAAT 

1201 GAAAATGGTC CCATACTATC ATTTAgAgGC ATTGACAACA GTBTGTTTTA 

1251 TACGCTAGCT CCTAAGGGTg AATTTTACAA CTACTCaGQA TGTGGAAATA 

1301 CCtTCAACTO TAATAATCCC ATTGTACGTC AATtTATAgT GATGCTGAgA 

1351 TATtGGGTTA (XGAAATiGCA CGTaCATGGC TTCCGCTTTG ATCTTGCTTC 

1401 TATCCTTACA AGAAGTAGCA GCTCGTGGAA TGCTGTAAAT OTCTATMAA 

1451 ATTCAATTGA CG6TGACGTO ATCaCCACAG GCACTCCTCT CACAAGCCCA 

1501 CCATTGATTG ATATGATTAG CAATGATCCA ATACTTCGTG GA6TAAARCT 
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1551 TATAGCTGAA GCATGGGATT GTGGAGGCCT TTACCAAGTT GGCATGTTTC 

1601 CGCACTGGGG TATCTGGTCG GAGTGGAACG GAAAGTACCG TGACATGGTA 

1651 CGGCAGTTCA TCAAAGGCAC TGATGGGTTT TCTGGGGCTT TTGCTGAATG 

1701 CCTTTGTGGA A6CCCAAATC TATACCAGAA AGGA6GAAGA AAACCAT6GA 

1751 ACAGTATAAA TTTCX3TGTOT GCCCACGATG GTTTTACTTT GGCTGATTTA 

1801 GTGACATACA ACAATAAACA CAATTTGGCA AATGGAGAGG ACAACAAAGA 

1851 CGGGGAGAAT CACAATAATA GTTGGAATTG TGGTGAGGAA GGAGAATTTG 

1901 CAAGTATCTT TGTGAA6AAA TTGAGGAAAA GACAAATGCG GAACTTCTTC 

1951 cTCTGCcTTA tGGTTTCCCA AGGT G TT CC C ATGATATATA TGGGCXSATOA 

2001 ATAT6GTCAC ACTAAGGGA6 GAAACaACAA CACGtATTGC CATCATAATT- 

2051 ATATTAATTA CTTCCGTTGG GATAAGAAGG ATGAATCTTC ATCTGATTTT 

2101 TTGAGATTTT GCGGCCTCAT GACCAAATTC CGCCATGAAT GTGAATCACT 

2151 GG6ATTA6AT GGTTTCCCTA CAGCAGAAAG GCTGCAATGG CATGGTCACA 

2201 CTCCTAGAAC TCCAGATT06 TCTGAAACAA GTCGATTCGT TGCATTCACA 

2251 CTGGTGGACA AAGTGAAGGG AGAACTATAT ATTGCXTTTTA ACGCCAC3CCA 

2301 TTTGCCTGTA ACGATTACAC TTCCAgATAG QCCTGGTTAT AGATGGCAGC 

2351 CXrrrTGTGGA CACAGGCAAA CCAGCACCAT TTGACTTCTT GACAGACGAC 

2401 GTTCCTGAGA GAGAGACAGC AGCCAAACAA TATTCTCATT TTCTGGACGC 

2451 GAACXAGTAT CCGAIGCTCA GTTATTCATC CATTATTCTT TTACTATCAT 

2501 CT6CTGATGA TGCAXA6TTT CATTCACCAA GTTAOGTGGA OQTAAATCAG 

2551 CTTCAGATTT TGTTATATGC AGTGAGCPTGT TACTTTGTAA ATAAAAOTAA 

2601 GAAGCAGGAC AGMCM3MC TGCAAACG6A TAAAATTTGT 6AGGAA6AAG 

2651 CTGATGATTT ATAAGAtACA CCTTGTaTTt TAATtGCATT TATATAAAAT 

2701 AAAATAnTAG TGAAATTGTc TGTQcGAAAa aaaaaaAAAA AAAAAATAAA 

2751 AAAAAAAAAA AAAAAAAAAA AACCATGGTA CCOGGATCCT CGAATTtiaAT 

2801 ATCAA6 
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1 CTCATTCCGA GGAATAGACA ACAAGGTTTA TTACATGGTA GATTTGAACA 

5X ACAATGCTCA GCTGCTGAAT TTCGCTGGAT GTGGAAATAC TTTTAACTGC 

101 AATCATCCTA CAGTCAT6GA ACTTATACTT GAAAGCTTAA GACACTGGGT 

151 CACCGAGTAT CATGTCGATG GATTTCGCTT TGATCTTGCT AGTGTTCTTT 

201 GCAGAQGGAC AGATGGTACT CCCATTAATG CTCCCCCCCT TGTAAAGGCC 

251 ATTTCCAAAG ATAGTOTATT GTCGAGGTGC AAAATTATTG CTGAGCCATG 

301 GGATTGTGGA GGCCTATATC TTGTTGGAAA GTTTCCGAAC TGGGACCGGT 

351 GGGCTGAGTG GAATG6GAAG TACCGCGATG ACATCAGGAG ATTTATAAAG 

401 GGGGATGCTG GCATGAAAGG AAATTTTGCA ACCCGTATCG CAGGTTCAGC 

451 GiSATCTGTAC AGAGTGAACA AGCGAAAGCC GTACCACAGT GTCAACTTCG 

501 TGATTGCCCA TQATGGCTIT ACCTTGTATG ACCrTGTTTC ATACAATAAT 

551 AAGCACAATG ATGCX3AACGG TGAAGGTGGC AATGATGGAT GCAATGACAA 

601 CTTCAGTTGO AATTGTGGAA TTGAAGGTGA AACTTCAGAT GCAAATATTA 

651 ACGCACTGC6 TTCACGGCAA ATGAAAAATT TTCATTrGGC ACTG A T GG T T 

701 TCTCA6G6AA CACCAATGAT GCTTATGGGG GATGAGTATG GGCATACCCX3 

751 CTATG6AAAT AATAACA6TT ATGGACATGA TACC6CCATC AACAATTTCC 

801 A6TGGG6ACA ATTGGAAGCA AGGAAGAATG ATCACTTCAG GTTCTTTTCC 

851 AAGATGATAA AGTTTCGACT GTCCCACaAt GTTCTTAGAA AGGAaAACTT 

901 CATTGAGAAG AACGACATTA CCTGGCTCGA GGACAACTGG XACAATCAAG 

951 AGAGTA6ATT CCTTGCATTT ATGCTCCATG ATGGGAATGG AGGAGATATT 

1001 TACTTGGCAT TTAATGCACA CCA CrmX rC ATCAAAACAG CAATACCTTC 

1051 ACCACCACGA AATAGAAGTT GGTACCGAGT GGTGGACACT AATCTQAAGT 

1101 CACCAGATGA TTTTOTTATT GAGGGAGTGT CTGGTATCAG TGAAACTTAT 

1151 GATGTTGCGC CXTTACTCTGC TATCCTTCTT GAAGCAAAGC AATAATTACC 

1201 GG6ACTATGC TGCTTTAGAT GTTGTCCATG TGTTATTACA GTATTACCTC 

1251 CTTCTGQATT GGATAGTTCA AATTGGAATT CAGGCTGTTA GCCTATAGAT 

1301 GTAGTAIGTT GAGCAGAAAT TTTGCAATAA GCAACCA6TT TTGTTCAAAA 

1351 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 
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1 LMGIiD*BYVI HQVLYLTLL* HCVLLMATSP IQLAVHSRLL SYGSTBSTKL 

51 VPSSSOIRGK IVCSLRKLBL EDMNFS6IGR MNDQBAPRRA HRRKALSASR 

101 ISLVPSAKRV PTYLFRTDIG GQVKVLVEKT HGKYKVLVEV LPLELSDAHS 

151 blvmvwgi.fr SDALCFMPLO LNRRGADGKS STVBTPFVQG PSGKVTVBLD 



251 ISFQPDGSVN FALFSRSARS WLCLYDDIS VBKPSLBIDL DPYIIIRSGDI 

301 WHAALDCSLP FKrYGYRCKA TTSGKGBLVL LDPYAKVXRR VIPRQGGSBI 

351 RPKYLGBLCL EPGYDWSGDV PPSLPMEKLI IYRI2IVTQFT KDKSSKLPDD 

401 LAGTFSGISE KHHHFKDLGV NAMLLEPIFP FDEQKGPYFP WRFFSPOINY 

451 GPSGDPLSAZ KSNKDHVKKL HANGIEVFLE WFTRTAEDA PLMNVDNFSiy 

501 CIKQGQYLHI QKALNGHYPI VQQNILDCLR HWVIBFHXDG FVFVNASSLL 

551 RGFMGEXLSR PPLVEAIAFD PILSKVKMXA DMHNPLTNDS KENLFPHPIRR 

601 WABZNMRFCD DIRDFLRGE6 UJcNLxTRLC 6SGDIFA6GR GPAFSFNYXA 

651 RNSGLTLVDL VSFSSNBVAS ELSWNCGQEG ATTNNZVLBR RLKQVRNFLF 

701 ILFISL6VPV LC9M6DEOGQS SOGPPAYDAR KSL6HMTLRT 6FGTQIAQFI 

751 SFLSNLRMRR SDLLQKRTFL KBEIIIQHHGS DQSPPRNDGP SSKFLAMTLK 

801 ADABVSQTLV SDZVGDLFVA FNGAGDSEXV ILPPPPTDMV WHRLVDTALP 

851 FP6FFDEK6T PVKDELVAYB MKSHSCLLFE AQRLAEIDSS KRKKQIRLSS 

901 RRQ^FVRPLS lYXCLNKRLF FLHK^EDFTB 1ITCI*TFSFA ASNRKKK 



201 



FKASLAPFYI 



SFYHKEOLVS 



OHBNSEIRSH 



RNTNFWPVG LSSGHPAPL6 
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1 G6RSRTS6SP 6LQBFBDPOT MAQSFSISVP HTLDHTLSLP QSSPMEIOBC 

51 PSISTYKPKL SFHNHLFSRR SSNGVDFESI WRKSRSSWN AAVDSGRGGV 

101 VKTAATAVW EKPTTERCRL RFYQGKPLPF GATATDGGVH FAVFQGNATA 

151 ATLCLITLSD LPEKRVTBQI FLDPLANKTG DVWHVFLKGD FEMMLYGYKF 

201 DGKFCPEEGH YFDSSQIVLD PYAKAIVSR6 EYGVLGPEDD CWPPNAGMVP 

251 FCFWZSLYWE GDLPLEVSHR EILxIxEMBV R6FTIHESSE TfCyPGTYLGV 

301 VBKLDHLKEL 6VIICIBLMPC HEFNELBYYS YHtSVUGDYKF HFW6YSTVNP 

351 FSPMGRYSSA 6LSHCGLGAI NSFKYLVKEA HKRGIEVIMD WFNHTAB6H 

401 EKBPIIiSFRG IDNSVFYTLA PKGEFYNYSG CGNTFNCNNP XVRQFIVMLR 

451 YWVTEMHVRG FRFDLA5XLT RSSSSWNAVN VYGNSIDGDV ITTGTPLTSP 

501 PLIDMISNDP ILRGVKLIAB ANDC6GLYQV GMFPHffGIWS BWNGKYRDMV 

551 RQFZKGTD6F SGAFABCLCG SPHLYQKGGR KPHNSIHFVC AHDGFTLADL 

601 VTYMNKHNLA MGEDMKDGBN HNNSTOC6EB GEFASXFVKK LRKRQMRNFF 

651 LCU1VSQ6VP MXYMGDBYGH TRGOINNTYC HDMYXKYFRW DKKDESSSDF 

701 LRFCGLMTKF RHECBSLGLD GFPTAERLQW HGHTPRTPDW SETSRFVAFT 

751 LVDKVKGELY lAFNASHLPV TITLPDRPGY RWQPFVDTGK PAPFDFLtDD 

801 VPBRETAAXQ YSRFLDANQY PHLSYSSXIL LLSSADDA*F H5PS*VBVMQ 

851 LQXIaLYAVRC YFVKKSKKQD RTBLQTDKXC EEEADDL^DT PCXLXAFX*H 

901 KlatVKLSVRK KRKKKIKXKK KKKKFWYPDP RlxYQ 
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X SFRGIDNKVY YMVDLNNNAQ . LLNFAGCGNT FKCNHPTVME LILESLRHWV 
51 TEYHVDGFRF DLASVLCRGT DGTPINAPPL VKAISKDSVL SRCKIIABPW 
101 DCGGLYLVGK FPNWDRWAEW NGKYRDDIRR FIKGDA01KG NFATRIAGSA 
151 DLYRVNKRKP YHSVNFVIAH DGFTLYDLVS YNNKHNDANG EGGNDGCNDN 
201 FSWMCGIEGE TSDANINALR SRQMKNFHLA LMVSQGTPMM LMGDEYGHTR 
251 YGNNNSYGHD TAINNFQWGQ LEARKNDHFR FFSKMIKFRL SHNVLRKENF 
301 lEKNDITWLE DNWYNEESRF LAFMLHDGNG GDIYLAFNAH HFSIKTAIPS 
351 PPRNRSWYRV VDTNLKSPDD FVIEGVSGIS ETYDVAPYSA ILLEAKQ+LP 
401 GLCCFRCCPC VITVLPPSGL DSSNWNSGC* PIDWC*AEI LQ*ATSFVQK 
451 KKKKJUUCKKK KK 

Fig. 15 

1 GATCATAACT TGAGTTCTAA GCGG 

Fig.UA 



1 CAGGAAACAG CTATGAC 

Fig, 16B 
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